CHARACTERIZATION OF THE OPTIMAL PLANS FOR THE 
MONGE-KANTOROVICH TRANSPORT PROBLEM 

CHRISTIAN LEONARD 

Abstract. We present a general method, based on conjugate duality, for solving a con- 
vex minimization problem without assuming unnecessary topological restrictions on the 
constraint set. It leads to dual equalities and characterizations of the minimizers without 
constraint qualification. 

As an example of application, the Monge-Kantorovich optimal transport problem is 
solved in great detail. In particular, the optimal transport plans are characterized with- 
out restriction. This characterization improves the already existing literature on the 
subject. 
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1. Introduction 

Although the title highlights Monge-Kantorovich optimal transport problem, the aim 
of this paper is twofold. 

• First, one presents an "extended" saddle-point method for solving a convex mini- 
mization problem: It is shown how to implement the standard saddle-point method 
in such a way that topological restrictions on the constraint sets (the so-called con- 
straint qualifications) may essentially be removed. Of course, so doing one has to 
pay the price of solving an arising new problem. Namely, one has to compute the 
extension of some function; this may be a rather difficult task in some situations, 
but it will be immediate in the Monge-Kantorovich case. This method is based 
on conjugate duality as developed by R.T. Rockafellar in [7j. Dual equalities and 
characterizations of the minimizers are obtained without constraint qualification. 

• Then, these "extended" saddle-point abstract results are applied to the Monge- 
Kantorovich optimal transport problem. In particular, the optimal plans are char- 
acterized without any restriction. This characterization improves the already ex- 
isting literature on the subject. 
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Other applications of the extended saddle-point method are investigated by the author 
in 0] in connection with entropy minimization. 

The Monge-Kantorovich transport problem. Let us take A and B two Polish (sep- 
arable complete metric) spaces furnished with their respective Borel u-fields, a lower 
semi continuous (cost) function c : AxB — ► [0, oo] which may take infinite values and 
two probability measures \i G Va and v G Vb on A and B. We denote Va,Vb and Vab 
the sets of all Borel probability measures on A, B and AxB. The Monge-Kantorovich 
problem is 

minimize ir G Vab | — > / c(a, 6) ir(dadb) subject to n G ^) (MK) 

J AxB 

where P(/i, z/) is the set of all 7r G Pas with prescribed marginals 7r^ = /x on A and 
-Kb = v on B. Note that c is measurable since it is lower semicontinuous and the integral 
I AxB C( ^ 7T e IP> °°] * s well-defined since c > 0. 

For a general account on this active field of research, see the books of S. Rachev and 
L. Riischendorf and C. Villani PUE]. 

Definition 1.1 (Optimal plan). One says that ix G P(^,u) is an optimal plan if it 

minimizes 7 1— > j AxB cdj on v) and / A><B cdn < 00. 

It is well-known that there exists at least an optimal plan if and only if there exists some 
7r° G P(fi, v) such that j AxB cdir° < 00; this will be recovered at Theorem 13.21 Definition 
ll.ll throws away the uninteresting case where f^g cdn = 00 for all 7r G P(/x, Note also 
that, since Monge-Kantorovich problem is not a strictly convex problem, infinitely many 
optimal plans may exist. 

Already existing optimality criteria. Some usual criteria are expressed in terms of cyclical 
c-monotonicity. 

Definition 1.2 (Cyclically c-monotone plan). A subset T C AxB is said to be cyclically 
c-monotone if for any integer n > 1 and any family (a±, 61), . . . , (a n , b n ) of points in T, 
J^r=i c ( a «' bi) — Y^4=i c ( fli ' with convention b n+ \ = b%. 

A probability measure 71 G Vab is said to be cyclically c-monotone if it is concentrated on 
a measurable cyclically c-monotone set T, i.e. vr(r) = 1. 

This notion goes back to the seminal paper [S] by L. Riischendorf where the standard 
cyclical monotonicity of convex functions introduced by Rockafellar has been extended in 
view of solving Monge-Kantorovich problem. 

While completing this paper, the author has been informed of the recent work P by 
W. Schachermayer and J. Teichman who have improved previous characterization criteria 
in several directions. The following definition introduced in P is useful to state 0's 
results in a concise way. 

Definition 1.3 (Strongly c-monotone plan). A transport plan 71 G P(fj,,u) is called 
strongly c-monotone if there exist two measurable functions if and ip on A and B taking 
their values in [—00, +00) such that 

( ip © ip < c everywhere , ^ 

1 (p ® if) = c Ti-almost everywhere. ' 

Here and below, we denote <p © xjj (a, b) = if (a) +■0(6). 
One easily shows that a strongly c-monotone plan is cyclically c-monotone. 
The main results of |9| are collected in the next two theorems. 
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Theorem 1.5 ( t 9j). Let c be a lower semicontinuous nonnegative finitely-valued function. 
If there exists some tt° G P{[i, v) such that cdix° < oo, then for any tt G P(/j,,v), the 
following three statements are equivalent: 

(i) 7r is an optimal plan; 
(ii) ii is cyclically c-monotone; 
(Hi) ii is strongly c-monotone. 

This result significantly improves an already existing criterion (see [TT], Chapter 5) 
where the same conclusion holds with a finitely-valued function c under the following 
constraint qualification: There exist two nonnegative measurable functions ca and c# on 
A and B such that 

c<ca®cb, / ca dpi < oo and / c# aV < oo. (1.6) 

J A JB 

Note that (jl.6|) implies that J AxB cdir < oo for all ti G P(f^, v). It also improves a result 
of L. Ambrosio and A. Pratelli [1] who have shown that, when c is finitely- valued and 
under the moment condition 

fx ((a G A; J B c(a, b) u(db) < oo j) > , _^ 

v ({b G -B; f A c(a, 6) //(da) < oo}) > ^ ' ' 

which is weaker than (|1.6jl . any cyclically c-monotone 7r in P(/i, z/) is both an optimal and 
a strongly c-monotone plan. For (jl.7j) to hold, it is enough that f AxB cdjj, ® v < oo. It is 
also proved in pQ that the functions ip and ip in (II. 4|) can be taken such that G /i) 
and ^ G L^B, u). 

The next result is concerned with cost functions c which may take infinite values. 

Theorem 1.8 (P]IH|). Let c be a lower semicontinuous [0,oo]-valued function. 

(a) Any optimal plan is cyclically c-monotone. 

(b) U 

H ® u{{c < oo}) = 1, (1.9) 

then any optimal plan is strongly c-monotone. 

(c) If there exists some tt° G P(/j,,u) such that fAxB c ^ n ° < 00 > ^ en anv s ^ ron 9k) c ~ 
monotone plan in P(fi, v) is an optimal plan. 

Statement (a) is proved in pQ, while statements (b) and (c) are taken from 

Examples 1.10. 

(1) An interesting example of a cyclically c-monotone plan which is not optimal is ex- 
hibited in pQ , in a situation where c takes infinite values and an optimal plan exists. 
This is in contrast with Theorem II .51 and emphasizes that cyclical c-monotonicity 
isn't the right notion to consider in the general case. 

(2) Take A — B — [0,1], fi(da) = da, u{db) = db the Lebesgue measure on [0, 1] and 
c(a, b) = if a = b and +oo otherwise. Condition (jl.9j) is restrictive enough to 
rule this basic situation out. In the present paper, this restriction is removed. 

A new optimality criterion. Our main results about the optimal plans are Theorems 13.31 
and 13.51 Next theorem sums them up. 

Theorem 1.11. Let c be a lower semicontinuous [0, oo]-v alued function and let ti G 
P(n,v) satisfy Jaxb ^ 71 < 00 • 
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(a) n is an optimal plan if and only if there exist two finitely-valued functions ip G M. A 
and ip G M. B such that 

ip © ip < c everywhere and , , 

<p (B ip = c ir -almost everywhere. ' 

(b) If 7i is an optimal plan, there exist two finitely-valued functions ip G M A and ip G M B 
such that ip G L\ (A, /x) , ip G Li(B,v), 

( \<p Q) 4>\ < c everywhere and (113) 
\ (f (B ip = c on supp 7r P){c < oo}. ^ ' ' 

Remarks 1.14. These results improve previous literature on the subject in several aspects. 

a. No restriction is imposed on c, /i and v. In particular, ()1.9|) is removed. 

b. For the optimality criterion (a), the so-called Kantorovich potentials if and ip are 
finitely-valued and are not required to be a priori measurable. This is in contrast with 
the definition of strongly c-monotone plans. 

c. The analogue of (b) is usually stated as follows: If tt is an optimal plan, there exist two 
[— oo, oo)-valued functions ip G Li(A, /i) and ip G £i(-B, v) such that (|1.12|) holds, even 
in the case where c is required to be finite. The improvements carried by (jl,13j) are: 

- The equality ip © ip = c holds on supp7r fl {c < oo} rather than only tt- almost 
everywhere; 

- The Kantorovich potentials <p and ip are finitely-valued; 

- We obtain \(p © ip\ < c rather than (p © ip < c. 

As an immediate consequence of Theorem 11.111 we obtain the following 

Corollary 1.15. Any it G P(/i, z/) satisfying J AxB cdTT < oo is an optimal plan if and 
only if it is strongly c-monotone. 

But the sufficient condition of Theorem 11.111 is weaker than the strong c-monotonicity, 
while its necessary condition is stronger. 

Finally, let us indicate why considering cost functions c possibly achieving the value +oo 
is a significant extension. In the finite- valued case, the domain of c is the closed rectangle 
AxB. If one wants to forbid transporting mass from A to B outside some closed subset 
S of AxB and only consider the finitely- valued lower semicontinuous cost function c on 
S, simply consider the extended cost function c on AxB which matches with c on S and 
is +oo outside. In this case, c has a closed effective domain. But there are also lower 
semicontinuous functions c whose domain is an increasing union of closed subsets. 

An abstract convex problem and related questions. Monge-Kantorovich problem 
is a particular instance of an abstract convex minimization problem which we present 
now. 

Let U be a vector space, £ = U* its algebraic dual space, $ a (— oo, +oo]-valued convex 
function on U and $* its convex conjugate for the duality (U, C). Let y be another vector 
space, X = y* its algebraic dual space and T : C — » X is a linear operator. We consider 
the convex minimization problem 

minimize subject toTieC, £eC (P) 

where C is a convex subset of X. As is well known, Fenchel's duality leads to the dual 
problem 

maximize inf (y,x) — &(T*y),y G 3^ (D) 

x&C 

where T* is the adjoint of T. 
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What about Monge-Kantorovich problem? We denote Ca, Cb and Cab the spaces of 
all continuous bounded functions on A, B and Ax B; C A , C* B and C* AB are their algebraic 
dual spaces. Taking C = C* AB the algebraic dual of U = Cab, T will be the marginal 
operator T£ = (£a,£b) E X := C* A x C* B which in restriction to those fs in C which 
are probability measures gives the marginals £a on A and £b on B and C will simply be 
{(/!, u)}. Choosing $(y?, if)) = if (p®ip<c and if)) = +00 otherwise, will lead us to 
Monge-Kantorovich problem. 

The usual questions related to (P) and (D) are 

• the dual equality: Does inf(P) = sup(P) hold? 

• the primal attainment: Does there exist a solution I to (P)? What about the 
minimizing sequences, if any? 

• the dual attainment: Does there exist a solution y to (D)? 

• the representation of the primal solutions: Find an identity of the type: £ G 
d$(T*y). 

We are going to answer them in terms of some extension $ of $ under the weak assumption 

T- 1 (C)ndiffdom$*/0 (1.16) 

where 

diffdom$* = {£ E C- d c *®*{£) ± 0} 
is the subset of all vectors in £ at which $* admits a nonempty subdifferential with 
respect to the algebraic dual pairing (£, £*) where £* is the algebraic dual space of £. 
Note that by the geometric version of Hahn-Banach theorem, the intrinsic core of the 
effective domain of the objective function $* : icordom <£>*, is included in diffdom$*. 
Hence, a useful criterion to get is 

T- 1 (C)nicordom$*^0. (1.17) 

The drawback of such a general approach is that one has to compute the extension $. 
In specific examples, this might be a difficult task. The extension $ is made precise at 
Section El for Monge-Kantorovich problem. Another important example of application 
of our general results is the problem of minimizing an entropy functional under a convex 
constraint. This is worked out by the author in jlj with probabilistic applications in mind; 
it is based on the explicit expression of the corresponding function $. 

The restriction (|1.17|) seems very weak since icordom $* is the notion of interior which 
gives the largest possible set. As T~ 1 (C)ndom $* = implies that (P) has no solution, the 
only case where the problem remains open when icordom $* is nonempty is the situation 
where T _1 (C) and dom$* are tangent to each other. This is used in jl] to obtain general 
results for convex integral functionals. 

Nevertheless, the Monge-Kantorovich optimal transport problem provides an interesting 
case where the constraints never stand in icordom $* (see Remark I3.18|) so that (|1.17|) is 
useless and (|1.16|) is the right assumption to be used. 

The strategy. A usual way to prove the dual attainment and obtain some representation 
of the primal solutions is to require that the constraint is qualified: a property which 
allows to separate the convex constraint set T _1 (C) and the level sets of the objective 
function. The strategy of this article is different: one chooses ad hoc topologies so that 
the level sets have nonempty interiors. This also allows to apply Hahn-Banach theorem, 
but this time the constraint set is not required to be qualified. We take the rule not to in- 
troduce arbitrary topological assumptions since (P) is expressed without any topological 
notion. Because of the convexity of the problem, one takes advantage of geometric easy 
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properties: the topologies to be considered later are associated with seminorms which are 
gauges of level sets of the convex functions $ and $*. They are useful tools to work with 
the geometry of (P). 

It appears that when the constraints are infinite-dimensional one can choose several dif- 
ferent spaces y without modifying the value and the solutions of (P). So that for a small 
space y the dual attainment is not the rule. As a consequence, we are facing the problem 
of finding an extension of (D) which admits solutions in generic cases and such that the 
representation of the primal solution is £ G d&(T*y) where $ is some extension of $. 
We are going to 

• use the standard saddle-point approach to convex problems based on conjugate 
duality as developed by Rockafellar in [7j 

• with topologies which reflect some of the geometric structure of the objective 
function. 

These made-to-measure topologies are associated with the gauges of the level sets of $ 
and $*. 

Outline of the paper. The abstract results are stated without proof at Section|21 Their 
proofs are postponed to Section 0] Section |H] is devoted to the application of the abstract 
results to the Monge-Kantorovich optimal transport problem. Finally, basic results about 
convex minimization and gauge functionals are recalled in the Appendix. 

Notation. Let X and Y be topological vector spaces. The algebraic dual space of X is 

X*, the topological dual space of X is X' . The topology of X weakened by Y is o~(X, Y) 

and one writes {X, Y) to specify that X and Y are in separating duality. 

Let / : X — > [—00, +00] be an extended numerical function. Its convex conjugate with 

respect to (X,Y) is f*(y) = swp xeX {(x,y) — f(x)} G [—00, +00], y EY. Its sub differential 

at x with respect to (X, Y) is d Y f{x) = {y G Y; f(x + £) > f(x) + (y,£),V£ G X}. If no 

confusion occurs, one writes df(x). 

For each point a, e a is the Dirac measure at a. 

Stop saying no, be strict. 1 The function sin x is not negative, but it is not nonnegative. 
It is not decreasing, but it is not nondecreasing. All this does not make much sense and 
is not far from being a nonsense for non-English speaking people. As a convention, we'll 
use the non-English way of saying that a positif function is a [0, oo)-valued function while 
if it is (0, oo)-valued it is also strictly positif. The integer part is a croissant (increasing 
in colloquial English) function and the exponential is also a strictly croissant function. 
Symmetrically, we also use the notions of negatif (negative in colloquial English) and 
strictly negatif, decroissant (decreasing in colloquial English) and strictly decroissant 
functions or sequences. To be coherent, [0, 00) and (—00, 0] are respectively the sets of 
positif and negatif numbers, and e > is also strictly positif. We keep the French words 
not to be mixed up with the usual way of writing mathematics in English. 

2. The abstract convex minimization problem 

In this section we give the statements of the results about the abstract convex minimiza- 
tion problem. The dual equality and the primal attainment are stated at Theorem 12.61 
the dual attainment and the dual representation of the minimizers are stated at Theorems 
12.91 and f2.13l Their proofs are postponed to Section HJ 



This is only a suggestion, not a demanding of the right to write maths differently. 
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2.1. Basic diagram. Let y be a vector space and X = y* its algebraic dual space. It is 
useful to define the constraint operator T by means of its adjoint T* : y — > £* (£* is the 
algebraic dual space of £), as follows. For all £ G £, x G X, 

We shall assume that the restriction 

r*(y) c u 

holds, where U is identified with a subspace of £* 

{u , c) 



is meaningful. 

2.2. Assumptions. Let us give the list of our main hypotheses. 

(H*) 1- $ : U -> [0, +oo] is convex and $(0) = 

2- Vw G W, 3a > 0, $(ctu) < oo 

3- VwGW,w^0,3£GM,$(» >0 

(# T ) i- r(y)cw 

2- ker T* = {0} 

(He) C\ — C fl A4 is a convex cr(A?i, 3*\)-closed subset of 

The definitions of the vector spaces X\ and which appear in the last assumption are 
stated below at Section 12.31 For the moment, let us only say that if C is convex and 
a(X ,y)-c\osed, then (H c ) holds. 

Comments about the assumptions. 

- By construction, $* is a convex cr(£, W)-closed function, even if $ is not convex. 
Assuming the convexity of $ is not a restriction. 

- The assumption (H$i) also expresses that $ achieves its minimum at u = and 
that $(0) = 0. This is a practical normalization requirement which will allow us 
to build a gauge functional associated with $. More, (H$i) implies that $* also 
shares this property. Gauge functional related to $* will also appear later. 

- With any convex function $ satisfying (H^), one can associate a function $ 
satisfying in the following manner. Because of (H^ 2 ), $(0) is finite and 
there exists £ a G £ such that 4 G <9<J(0). Then, $(u) = $(u) - - $(0), 
i/GW, satisfies (#$i) and $*(£) = $*(£- 4) - $(0), £ G £. 

- The hypothesis (#$3) is not a restriction. Indeed, assuming let us suppose 
that there exists a direction w 7^ such that $(£tt ) = for all real £. Then any 
I G £ such that (£,u a ) 7^ satisfies $*(£) > sup tgR u Q ) = +00 and can't be a 
solution to (P). 

- The hypothesis (Htq) isn't a restriction either: If y\ — y 2 G ker T*, we have 
(T£, yi) = (T£, y 2 ), for all £ G £. In other words, the spaces y and 3^/ker T* both 
specify the same constraint sets {£ G C;T£ — x}. 

The effective assumptions are the following ones. 

- The specific form of the objective function $* as a convex conjugate makes it a 
convex <r(£, W)-closed function. 

- (H$ 2 ) and (He) are geometric restrictions. 



(2.1) 

W* . It follows that the diagram 

(Diagram 0) 
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- (Hti) is a regularity assumption on T. 



2.3. Variants of (P) and (D). These variants are expressed below in terms of new 
spaces and functions. Let us first introduce them. 



The norms \ ■ |$ and \ ■ \ A . Let $±(w) = max(<I>(-u), $(—«)). By (H$i) and (H^ 2 ), {u G 
< 1} is a convex absorbing balanced set. Hence its gauge functional which is 
defined for all u G U by = inf{a > 0; &±{u/a)) < 1} is a seminorm. Thanks to 
hypothesis (H§ 3 ), it is a norm. 
Taking {Hti) into account, one can define 

A(y)^^T*y),yey. (2.2) 

Let A±(y) = max(A(?/), A(— y)). The gauge functional on y of the set {y G y; A±(y) < 1} 
is \y\\ — inf{a > 0; A±(y/a) < l},y G y. Thanks to and (Ht), it is a norm and 

\y\ A = \T*y\*, yey. (2.3) 



The spaces. Let 

Wi be the | • ^-completion of IA and let 

£1 = (U, I ■ |$)' be the topological dual space of (U, \ ■ |$). 

Of course, we have (Ui, \ ■ |$)' = £iC£ where any £ in U[ is identified with its restriction 
to IA. Similarly, we introduce 

y± the I • |A-completion of y and 

Xi = (y, I • I a)' the topological dual space of (y, | • |a)- 

We have (3\, | ■ |a)' — X x (Z X where any x in y[ is identified with its restriction to y. 
We also have to consider the algebraic dual space C\ and X{ of C\ and X\. 

The adjoint operators ofT. It will be proved at Lemma f4. II that 

TC X C X l (2.4) 

Let us denote 7\ the restriction of T to C\ C C By ()2.4|) . we have T\ : L\ — > X\. Let us 
define its adjoint T 2 * : X* — > £* for all a; G A^* by: 

(£,T» £l , £t = (T x l,u>) XliX *,W G £1. 

This definition is meaningful, thanks to (|2.4|) . We denote T* the restriction of T 2 * to 
C XI . Of course, it is defined for any y G 3^i, by 

{l,TZy) c „ci = {y,Tl)y uXl , We4 

It will proved at Lemma f4. II that 

T^i C U x (2.5) 

We have the inclusions y C C X*. The adjoint operators T* and T* are the restrictions 
of T* to y and ^i- 



OPTIMAL TRANSPORT PLANS 9 

Some modifications o/$ and A. The convex conjugate of $ the dual pairing (U,£) is 

$*(£) = sup{(u,£) -®(u)},£ E £ 

We introduce the following modifications of $ : 

$o(«) = sup{(u,£) -$*(£)}, u GW 

$i(«) = sup{(u,£) -$*(£)},« GWi 
eeCi 

$ 2 (C) = su P {(£,C)-^W},Ce£t. 

They are respectively a(U,£), a{Ui, C\) and cr{C\, £i)-closed convex functions. It is 
immediate to see that the restriction of $2 to U\ is $1. As C\ = U[, $1 is also the | ■ In- 
closed convex regularization of $. The function $2 is the extension $ which appears in 
the introductory Section [T] 
We also introduce 



A (y) = ® (T*y), yey 

AM ± $i(T*y), yey 1 

A 2 (co) = $ 2 (T», u E X* 

which look like the definition (j2.2j) . Note that thanks to (Hti) and ()2.5|) . the first equalities 
are meaningful. Because of the previous remarks, the restriction of A 2 to is Ai. 

The optimization problems. Let $q and $^ be the convex conjugates of $0 an d $1 with 
respect to the dual pairings (U,C) and (U%,£i) : 

%(£) = sup{(«,^>-$ («)}^e£ 

= sup{(u,£) -l^yeA 

and Aq,A* be the convex conjugates of A , Ai with respect to the dual pairings (y,X) 
and (y 1 ,X 1 ): 

A* (x) = sup{(y,x) - A (y)},x E X 
yey 

Al(x) = sup{(y,x) - Ai (y)}, x E X 1 
yeVi 

Finally, denote 

d = C7 n Af a . 
The optimization problems to be considered are 



minimize <$>*(£) 




subject to T£ E C, 


£eC 


(P) 


minimize 




subject to T£ E Ci, 


£Ed 


(Pi) 


minimize Aq(x) 




subject to x E Ci, 


x E X 1 


( p l,x) 


maximize inf (y, x) - 


- Ao(y), 


yey 




(Do) 


maximize inf (y, x) 


-Ai(y), 


yey t 




Pi) 


maximize inf (x, 


-A 2 (w), 


io E X{ 




(D2) 
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2.4. Statement of the abstract results. We are now ready to give answers to the 
questions related to (P) and (D) in an abstract setting. 

Theorem 2.6 (Primal attainment and dual equality). Assume that (H$) and (Ht) hold. 

(a) For all x in X, we have the little dual equality 

inf {$*(£); £ G C,T£ = x} = A* (x) G [0,oo]. (2.7) 

Moreover, in restriction to X ± , Aq = and A{ is a(X 1 ,y 1 )-inf-compact. 

(b) The problems (P) and (Pi) are equivalent: they have the same solutions and 
inf(P) =inf(Pi) g [0,oo]. 

(c) If C is convex and a(X ,y) -closed, we have the dual equality 

inf(P) = sup(Po) e [0,oo]. 
Assume that (H$), (Ht) and (He) hold. 

(d) We have the dual equalities 

inf(P) = inf (Pi) = sup(£>i) = sup(P> 2 ) = inf A* (x) G [0, oo] (2.8) 

(e) If in addition inf(P) < oo, then (P) is attained in C\. Moreover, any minimizing 
sequence for (P) has a(Ci,Ui) -cluster points and every such cluster point solves 

(!■>)■_ 

(f) Let £ G C\ be a solution to (P), then x = T£ is a solution to (Pi x) and inf (P) = 
$*(/)= A* (x). 

Theorem 2.9 (Dual attainment and representation. Interior convex constraint). Assume 
that (H§), (Ht) and (H c ) hold and also suppose that the interior constraint qualification 

CnicordomA* ^ (2.10) 
is satisfied. Then, the following statements hold true. 

(a) The primal problem (P) is attained in C\ and the dual problem (D 2 ) is attained in 
X* 

(b) Any £ G C\ is a solution to (P) if and only if there exists cu G X* such that the 
following three statements hold 

(1) TleC 

(2) (T£, u) < (x, Q) for all x G d 

(3) £ e d Cl <5> 2 (T*u) 

More, these three statements hold if and only if: £ is a solution to (P), u is a solution 

to (D 2 ) and inf(P)=sup(P 2 )- 

It is well-known that the representation formula 

£ e dc&ffio) (2-H) 

is equivalent to Young's identity 

+ $ 2 (T 2 *cu) = (T£,Cu). (2.12) 

(c) Any solution uj of (D 2 ) shares the following properties 

(1) Co stands in the a -(X* , Xi)- closure o/domAi. 

(2) T^uj stands in the a ■(£*, C\) -closures o/T^domAx) and dom$. 

(3) For any x Q in C fl icordomA*, uj is jD Xo -upP er semicontinuous and j_E> Xo -lower 
semicontinuous at 0, where jo Xo and j-D Xo are the gauge functionals on X\ of the 
convex sets D Xo — {x G X\\ Aq(x g + x) < A\(x Q ) + 1} and —D Xo . 
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As will be seen at Section El the Monge-Kantorovich problem provides an important 
example where no constraint is interior (see Remark I3.18j) . In order to solve it without 
imposing constraint qualification, we are going to consider the more general situation 
(jl.lfij) where the constraint is said to be a subgradient constraint. This means that x G 
diffdomAg with 

diffdomA* = {x G Xi,dx*A.* (x) ^ 0} where 
d x *A* Q (x) = {u G X*;A*(x') > A*(x) + (x' - x,u),W G X\\. 

Two new optimization problems to be considered are 

minimize $*(£) subject to T£ = x, £ G C (P x ) 

maximize (x , u) — A 2 (u)) , u G X* (D x ) 

where x G X. This corresponds to the simplified case where C is reduced to the single 
point x. 

Theorem 2.13 (Dual attainment and representation. Subgradient affine constraint). Let 
us assume that (H$) and (H?) hold and suppose that x G domAp. Then, inf(P x ) < oo. If 
in addition, 

x G diffdomA*, (2.14) 
then the following statements hold true. 

(a) The primal problem (P x ) is attained in C\ and the dual problem (D x ) is attained in 
X*. 

(b) Any I G £i is a solution to (P x ) if and only ifT£ = x and there exists uj G X* such 
that \2.11\) or equivalently holds. 

More, this occurs if and only if: £ is a solution to (P), Co is a solution to (D x ) with 
x:=T£ and inf(P*) = sup( J Df ). 

(c) Any solution u of {D x ), shares the following properties 

(1) uj stands in the a (X* , X{) -closure o/domAi. 

(2) T^oj stands in the a (£*, C\)- closures o/T x *(domAi) and dom$. 

(3) Let uo be any solution of (D x ) with x G icordomAQ. Then, u is jn- -upper semi- 
continuous and j-D- -lower semicontinuous at where D% = {x E X x ; Aq(x + x) < 
A* Q (x) + 1}. 



3. Application to the Monge-Kantorovich optimal transport problem 

We apply the results of Section [21 to the Monge-Kantorovich problem. Recall that we 
take A and B two Polish spaces furnished with their Borel a-fields. Their product space 
Ax B is endowed with the product topology and the corresponding Borel cx-field. The 
lower semicontinuous cost function c : Ax B — > [0, oo] may take infinite values. Let us also 
take two probability measures \x G Va and v G Vb o n A and B. The Monge-Kantorovich 
problem is 

minimize ti G Vab / c(a, b) Ti(dadb) subject to n G P(fJ>, v) (MK) 

J AxB 



where P(fi, v) is the set of all 7r G Vab with prescribed marginals tta = n on A and ttb = v 
on B. 
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3.1. Statement of the results. Let us fix some notations. We denote Ca, Cb and 
Cab the spaces of all continuous bounded functions on A, B and Ax B. The Kantorovich 
maximization problem: 

maximize I (pdu + / ib dv for all o?, ib such that 

J A Jb (K) 

ip G Ca, 4> G C B and tp (B ip < c 

is the basic dual problem of (MK). We also consider the following extended version of 
(K): 

maximize / cpdu + / ib dv for all 02 G M" 4 , ib G IR B such that 

Ja Jb (K) 

(p G Lx(A, //), if) G Li(B, v) and © ^ < c everywhere on AxB. 

Remark 3.1. The real- valued function p G M A is defined everywhere, rather than //-almost 
everywhere, and (p G £i(v4, /i) implies that it is ^-measurable. This means that there exists 
some measurable set iV^ C A such that /j,(Na) = and ljv^V i s measurable. A similar 
remark holds for ib. 

The set of all probability measures 7r on AxB such that J AxB cdn < 00 is denoted P c . 
By Definition II. 1[ an optimal plan stands in V c . In the next theorem, V c will be endowed 
with the weak topology cr(V c , C c ) where C c is the space of all continuous functions u on 
AxB such that \u\ < k(l + c) for some k > 0. 

Theorem 3.2 (Dual equality and primal attainment). 

(1) The dual equality for {MK) is 

M(MK) = sup(K) = sup(J?) G [0, 00]. 

(2) Assume that there exists some ir° in P(fi,v) such that Jaxb ^" < 00 • Then: 

(a) There is at least an optimal plan and all the optimal plans are in V c \ 

(b) Any minimizing sequence is relatively compact for the topology a(V c ,C c ) and 
all its cluster points are optimal plans. 

This result is well-known. The dual equality inf(MK) = sup(JT) = sup(JT) is the 
Kantorovich dual equality. The proof of Theorem 13.21 will be an opportunity to make 
precise the abstract material $, U, T . . . in terms of the Monge-Kantorovich problem. 

Next, we state the characterization of the optimal plans without restriction. 

Theorem 3.3 (Characterization of the optimal plans). 

(1) A probability measure 7r G Tab is an optimal plan if and only if there exist two 
finitely-valued functions (p G M. A and if) G M B such that 

ti a = /i, tt b = v, J^cdn < 00, 

<p © if) < c everywhere and (3-4) 
<p © if) = c iT-almost everywhere. 

(2) Let ip and if) be finitely-valued functions on A and B and let be tt G Vab- 

(a) If ip is fi-measurable andib is v-measurable, the following statements are equiv- 
alent: 

- <p, if> and Ti satisfy \cS-4\l ; 

- -k is an optimal plan and (ip,if>) is a solution of(K). 

(b) In the general case where (p and if) are not assumed to be measurable, consider 
the following statements: 
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(i) (p, ijj and tt satisfy \3.Jj\) ; 

(ii) \sip, and tt satisfy \3.4\) ; 

(Hi) tt is an optimal plan and (\s if, Is ip) is a solution of (K). 
Then: (i) =>- (ii) (in). 

This new result improves the already existing literature on the subject. It is important 
to note that the functions f and ip satisfying \3.4\j are neither assumed to be integrable 
nor to be measurable. Next theorem shows that they can be further specified. 

Theorem 3.5 (More about necessary conditions). Assume that tt is an optimal plan. 
Then, there exist two finitely-valued functions ip G ffi A andip G M. B such that if G Li(A,/i), 
ip G Li(B, v) and 

J |y©Vl — c everywhere and 

\ if © ip = c on supp tt P){c < 00} . 

Clearly, (<p,ip) is a maximizer of (K). 

Remarks 3.6. 

a. Note that any optimal plan tt is satisfies supp7r C cl{c < 00}. 

b. Recall that tt is said be to concentrated on the measurable set V if vr(r) = 1. For 
instance, ()3.4l -c) is equivalent to the existence of some set V on which tt is concentrated 
and if @ip = c on T. The support of tt, denoted supp7r, is the closure of the union of 
all the sets T on which tt is concentrated. 

3.2. Proof of Theorem 13. 2L We apply the general results of Section |21 



The operators T and T*. The algebraic dual spaces of Ca, Cb and Cab are C* A , C B and 
C* AB . We define the marginal operator 

Te=(i A ,£ B )eC A xC B , £eC* AB 

where (if, £ A ) = (if <g> 1, i) and (ip, £ B ) = (1 <g) ■0, £) for all if G C A and all ip G C B . 
Let us identify the operator T*. For all (<f,i/i) G C A x C B and all £ G C, we have 
(T*(if,ij),l) c * >c = ((<f,i>),(e A ,e B )) = (<f,e A ) + (i>,£ B ) = (if@^,l) u ,c where^©^(a,6) = 
ip (a) + ip(b). Hence, for each ip G C A and ip G C B , 

T*(ip,^)=if®^eC AxB . (3.7) 



The problem (P). Then, the Diagram is built with U = Cab, £ = C AB , X = C\ x C* B 
and y = Ca xCb. Here and below, we denote the convex indicator function of the set X, 

£ / x _ J if x G X 
x ^ 1^ +00 otherwise. 

Choosing C = {(/z, z/)} and $(ii) = 5{ u < c },^ G Cab we get $*(£) = sup{(u,£);u G 
Cab, m < c}, £ G and we obtain the primal problem 

minimize $*(£) subject to £a = ^ and £3 = v, £ G C* AB . (P) 

It will be shown at Proposition 13. 12l that the corresponding problem (Pi) is (MK). 
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The problem (D ). Now, let's have a look at $ - As {u G Cab',u < c} is convex and 
o~{Cabi C^ B )-closed, we have <3> = Therefore, for each ip G and ip G Cg, 

Ao(<p,i(>) = H<p,ip) = ®(T*((p,ip)) = 5 {ip ^< c} 

and the dual problem is 

maximize / (pdfi + / ip dv subject to <p © ip < c, <£> G CU, */> G Cs (D ) 
J a J B 

whose value is 

A*(fi, v) = sup <yj ipdfi + J ip dv\ ip G Ca, ip G C B (f ® ip < c j 

As A = A and A^ = AJ ( Theorem ICTa). we have: Ag = A* = A*. 
T/ie hypotheses (H). We begin with a simple remark. 

Remark 3.8. One can choose c > 1 without loss of generality. Indeed, with c > 
taking c = c + 1 one obtains Jaxb ^ = Iaxb^^ 17 ~ ^ ^ or a ^ n e ^ab- Consequently, 
the minimization problems (MK) and (MK) associated with c and c share the same 
minimizers and their values are related by 'mi(MK) = mf (MK) — 1. 
It also follows from these considerations that our results still hold under the assumption 
that c is bounded below rather than c is positif. 

We assume from now on that c > 1. This guarantees (H<p 2 )- in the case where c is 
finitely valued, the remaining hypotheses (H) follow by (|3.7|) and direct inspection. 

If c is infinite somewhere, then (H^) fails. Indeed, for any function u G Cab, we have 
$(tu) = for all real t if and only if {u ^ 0} C Ax B \ S where 

S = cl {c < oo} 

is the closure of {(a, b) G Ax B; c(a,b) < oo}. The way to get rid of this problem is 
standard. Let u ~ v be the equivalence relation on R^ 4 * 8 defined by u\s = v\s, i.e. u and 
v match on S. The space U to be considered is the factor space 

U := Cab/ ~ 

Clearly, if it ~ t> then $(tt) = $(u). Hence, it is possible to identify without loss of 
generality any u G Cab with its equivalence class which in turn is identified with the 
restriction u\s of u to S. 

The problem (Pi)- Recall that c > 1 without loss of generality. Let us first identify the 
space Ci. As $±(u) = £{|«|< c }, we obtain the seminorm \u\<s> = sup \u/c\ := ||w|| c on Cab 
which becomes a norm on 14, 

Ui = C c := {u\g] u : Ax B — > R, ?i continuous and |tt| < fee for some real /c} 

£l = (Cc, || " ||c)' = C' c . 

Obviously, any 7r in V c has its support included in S and belongs to C' c with the dual 
bracket (u\s,ir) = j s udir, u\$ G C c . In what follows, it will be written equivalently 

• u\s G C c to specify that the equivalence class of u stands in C c and 

• u G C c to specify that the restriction u\s of the continuous function u on A x B 
stands in C c . 
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Clearly, the function $x is 

$i(u) = 5 {u < c} , ueC c 
and the modified primal problem is 

minimize subject to £a = £t afl d ^b = v , £ G C£ (Pi) 

where for each £ E C' c , 

$l(£) = sup{(u,£);u eC c ,u< c}. 
Remark 3.9. Two representations of C^. 

a. Let C C (S) be the space of all continuous functions won5 (w.r.t. the relative topology) 
such that ||iw|| c = sup 5 |w/c| < oo and C C (S)' be the topological dual space of the 
normed space (C C (<S), || • || c ). Let £ be the subspace of all functions in C C (S) which can 
be continuously extended to the whole space Ax B. There is a one-one correspondence 
between C' c and the dual space (£, \\ ■ || c )'. 

b. There is also a one-one correspondence between C' c and the space of all linear forms £ 
on the space of all continuous functions on A x B such that supp £ C S (see Definition 
13. 1UI below) and sup{(u, I) = (u\s, £);u : \\u\\ c < 1} < oo. 

Definition 3.10. For any linear form £ on the space of all continuous functions on Ax B, 
we define the support of £ as the subset of all (a,b) G Ax B such that for any neighborhood 
G of (a, b), there exists some function u in Cab satisfying {w ^ 0} C G and (u, £) ^ 0. It 
is denoted supp £. 

Definition 3.11. 

(a) One says that £ G C' c acts as a probability measure if there exists £ G Vab such that 
supp£ C S and for all u G Cab, (u\s,£) — fs u d£- I n this case, we write: £ G Vs- 

(b) One says that £ G C' c stands in V c if there exists £ G V c such that for all u G C c , 
(u\s,£) = f s ud£. In this case, we write: £ G V c . 

Of course, if there exists £ satisfying (a), it belongs to V c and is unique since any 
probability measure on a metric space is determined by its values on the continuous 
bounded functions. This explains why the notation £ G V c in (b) isn't misleading. 
Note also that any probability measure £ G V c has a support included in S. Since A x B 
is a metric space, for any £ G V c acting as a measure, supp£ in the sense of Definition 
13. 1UI matches with the usual support of the measure £. 

Completing the proof of Theorem \3. 6 A The full connection with the Monge-Kantorovich 
problem is given by the following Proposition 13 . 121 Clearly, with this proposition in hand, 
Theorem 13.21 directly follows from Theorem 12.61 and the obvious inequalities sup(i^) < 
sup(Z) < M(MK). 

Proposition 3.12. For all £ G C' c , 

(a) $*(£) < oo ^£ > 0, 

(b) <$>{(£) < oo =>■ supp£ C S, 

(c) [£> 0, supp £ C S,£a = \i and £q = v] I G Vs and 

(d) for all £ G Vs, = J s cd£. 
It follows that 

- dom$J C V c and 

- the problems (MK) and (Pi) share the same values and the same minimizers. 
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Proof. Clearly, the last statement follows from the first part of the proposition. The proof 
is divided into four parts. 

• Proof of (a). Suppose that £ £ C' c isn't in the positif cone. This means that there exists 
u £ C c such that u > and (u , t) < 0. Since u Q satisfies \u < < c for all A < 0, we 
have $*(£) > sup A<0 {(A-u o , £)} = +oo. Hence, < oo implies that £ > and one can 
restrict our attention to the positif £'s. 

• Proof of (b). Suppose ad absurdum that suppf £ S. Then, there exists a positif function 
u £ Cab such that {u a > 0} n S = and (w , > 0. As tu a < c\axb\s = 00 f° r ah t > 0, 
$*(£)> su Pt>0 {(tw o ,£>} = +oo. 

• Proof of (c). Let us take t > such that supp f C 5, ^ = |i and — v. It is clear that 
(1, £) = 1. It remains to check that for any £ E C' c 

[£ > 0, supp £ C S,£a = H and £b = ^] =>■ ^ is cr-additive, (3.13) 

rather than only additive. Since AxB is a metric space, one can apply an extension 
of the construction of Daniell's integrals ([5], Proposition II. 7.2) to see that £ acts as a 
measure if and only if for any decroissant sequence (u n ) of continuous functions such that 
< u n < 1 for all n and lim n _>ooWn = pointwise, we have lim n ^oo(u n , £) = 0. This 
insures the cr-additivity of £. Note that as supp I C S, for all u £ C c one can shortly write 
(u,£) instead of the meaningful bracket (u\s,£). 

Unfortunately, this pointwise convergence of (u n ) is weaker than the uniform convergence 
with respect to which any £ £ C' c is continuous. Except if Ax B is compact, since in this 
special case, any decroissant sequence of continuous functions which converges pointwise 
to zero also converges uniformly on the compact space S. 

So far, we have only used the fact that AxB is a metric space. We now rely on the 
Polishness of A and B to get rid of this compactness restriction. It is known that any 
probability measure P on a Polish space X is tight (i.e. a Radon measure): for all e > 0, 
there exists a compact set K e C X such that P(X \ K e ) < e (pj, Proposition II. 7.3). 
As in addition a Polish space is completely regular, there exists a continuous function f e 
with a compact support such that < f e < 1 and f x (l — f € )dP < e. This is true in 
particular for the probability measures £ Va an d v £ Vb which specify the constraint 
in (MK). Hence, there exist ip e £ Ca and ip e £ Cb with compact supports such that 
< y?e, ipe < 1 and < j A (l — ip e ) dfi, j B (l — ip e ) dv < e. It follows that any £ £ C' c with 
£a = A* an d £b = v satisfies < ((1 — ip t ® ip t ),£) < 2e. With the following easy estimate 
< (u n ,£) < 2e + (u n (<f e <S> ?pe),£) and the compactness of the support of <p t <g> -^e, one 
concludes that lim n _ +DO (u n ,^) = which proves ()3.13|) . 

• Proof of (d). As c is bounded below and lower semicontinuous on a metric space, it 
is the pointwise limit of a croissant sequence (c n ) of continuous bounded functions. It 
follows from the monotone convergence theorem that for any £ £ Vs, §\{£) = j s cd£. 
This completes the proof of the proposition. □ 

Optimal plan: an overview of the proofs of Theorems 13.31 and 13. 5L The proofs 
of these theorems are postponed to Section 13.61 We first derive preliminary results at 
Sections 13. 3[ 13.41 and 13.51 

At Section I3.3[ the abstract results of Section El are translated in terms of the Monge- 
Kantorovich problem. This is summarized at Theorem 13.241 which states an abstract 
characterization of the optimal plans. This theorem directly results from the extended 
saddle-point method. In particular, the optimal plan tt is related to some linear form 
uj £ X*. It remains to show that uj is the extension of some couple of functions (<f,i^)- 
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This is done at Section l3~4l for the sufficient condition and at Section l3~B1 for the necessary 
condition. The main results of Sections 13.41 and 13.51 are respectively Lemma 13.311 and 
Lemma 13.431 

3.3. Optimal plan: applying the extended saddle-point method. The main result 
of this section is Theorem 13 . 241 which gives an abstract characterization of an optimal plan. 

The space X\. By ()2.3|h we see that \(<p, iP)\a = ||y? © V'llc- This leads to 

X 1 = {(ki,k 2 ) G C* A x C*; |(ki,k 2 )H < oo} 

where |(ki,k 2 )|X = sup{(y2, /%) + (^>, k 2 ); 4>) G 3^i, ||<£> © V'llc < !}• The dual equality 
O gives 

k 2 )|1 = inf {\\£\\* c ; i G C' c : i A = Kl , i B = k 2 ] . 

Note that X\ is the space of all /t 2 ) G C* A x C# such that K\ = l A and k 2 = £b for 
some £ in C' c . Recall that the elements of X\ = y[ are identified with their restriction to 
y which is dense in 3^i- 

Remark 3.14 (The space and the problem (-Di)). The exact description of y% and (Di) 
will not be used later. Nevertheless, as an illustration of our general results, we describe 
them assuming that c is finitely valued. As A(cp,ip) = 5y^< c }i one sees that 

= {(</?, if))] f : A — > E continuous, ^ : B — > E continuous : y> © ^ G C c } 

This result is not as obvious as it seems to be. It follows from an interesting paper [2J 
of J.M. Borwein and A.S. Lewis which studies the convergence of sequences of the form 
(<f n @ip n )n>i- The additive form tp @ip in the expression of y± is proved at ([2], Corollary 
3.5) and the continuity of <p and if) is a consequence of ( [2] , Proposition 5.1). 
The corresponding problem (Di) is 

maximize / Lfdfi+ / if) dv for all p, if) such that 

J A J B 

(p, if) continuous, ip © if) G C c and <p ®if) < c. 

Anyway, we won't use this dual problem since it is sandwiched between (-Do) and (-D 2 ). 

The extension $ 2 . To proceed, one has to compute the extension $ 2 . As it is the greatest 
convex o~(C'*, C£)-lower semicontinuous extension of $, we have 

$2(0 = <%(£), eeC (3-15) 

where T is the <j(C'*, C^-closure of 

r = {u < c} C C7 C . 

Any u G X* is decomposed as uj = (u>a,Ub) where for all (ki,k 2 ) G X\, (u), (ki,k 2 )) = 
(a), 0)) + (a), (0, k 2 )) = (cja, «i) + (wbi ^2) where G X* is seen as the restriction to 
X\ of some linear form u) ow X = C* A x C* B . The adjoint operator T 2 * is defined for all 
uj G X* and £ G C' c by (T 2 V>,£) = (u;a,£a> + (u) B Jb) ■= (uj a ®u b ,£). That is 

T*u) = u) A ®u) B eC' c *. (3.16) 

This yields 

A 2 (uj) = <5|t(cl>a © £J B ), u G Af x * 

and the extended dual problem (D 2 ) is 

maximize {u)a, (J>) + (^b, uj G A"* sucii that uja © wb G V (D 2 ) 
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Note that for this dual problem to be meaningful, it is necessary that (He) holds: i.e. 
(fi, v) G X\. This is realized if (fi, v) G dom A* or equivalently if 'mi(MK) < oo. 

The constraint qualification. One will be allowed to apply Theorem 12.131 under the con- 
straint qualification (|2.14j) : 



Let us give some details on this abstract requirement. 

Remark 3.18. Note that for all \x G Va, v G Vb, (a*, v) $ icordomA* if AxB is an infinite 
set. Indeed, for all 7r G P(fi,u) such that J AxB cd7r < oo, one can find (a ,b ) such that 
with S(a ,b ) the Dirac measure at (a ,b ), it '■= t£(a ,b ) + (1 — t)ir ^ for all t < 0, so 
that $*(£ t ) = +oo ( Proposition 13.121 -a). This shows that [4,4] = [tt, £( ao ,& )] C dom$J 
while 4 G" dom$* for all t < 0. Hence, (/x, z/) G" icordomA* and one has to consider the 
assumption ()3.17j) on (/i, z/) rather than (/i, z/) G icordomA*. 

This is in contrast with the situation encountered in [3] where the rule is x a G icordomA*. 
Lemma 3.19. We have dom A* = diffdomA*. 

Proof. Proposition l3~T2l -a states that dom$^ C C + where C + = {£ G C' c ; i > 0} is the the 
positif cone of C' c . Therefore, $J = $* + 5 C +. Consequently, with (|2.7|) one obtains that 



Suppose ad absurdum that there is some x a G dom A* such that x Q G" diffdomA*. This 
implies that there exists some half-line }x ,x + oo(x' a — x Q )[ on which A* achieves the 
value +oo, which in turn implies that $* must achieve the value +oo somewhere on C + . 
But this is impossible since &*(£) = \\£\\* for all £ G C + . This completes the proof of the 



As a consequence of this lemma, it appears that (J3.17|) is not a constraint qualification. 
One can apply Theorem 12. 131 under the only restriction that inf (MK) < oo. This gives 
the following 

Lemma 3.20. Let us assume that inf (MK) < oo. Then, (P) and (D 2 ) both admit a 
solution in Vab and X*. Furthermore, any (tt,u) G Vab x X* is a solution of (P) and 
(D2) if and only if 



(/i, v) G diffdomA*. 



(3.17) 



A*{x) = inf {$*(£); £g£ + ,T£ = x}, x e X, 



lemma. 



□ 




(3.21) 



As $* and $2 are mutually convex conjugates, (|3.21l -b) is equivalent to 

77 G dentin) 
and also equivalent to Young's identity 



(3.22) 



#[(7r) + $ 2 (r?) = <77,7T> 



(3.23) 



and also equivalent to 



f $ 2 (r/) = 

I (V, TT) = I AxB cd7C - 



In other words: 



Theorem 3.24. Let n G P(fi, v) be such that Jaxb ^ < 00 • Then: 
(1) (D 2 ) admits at least a solution in Xf; 
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(2) ir is an optimal plan if and only if there exists some to G X* such that 

(a) t^uj e r 

(6) (u, (fi,u)) = fjxgcdn; 

With 1] = T 2 *uj, this implies the equivalent statements A3. 2 li b). \3. 22]) or A3. 23)) . 

(3) If such an u exists, it is a solution of (D 2 ) and any other solution of (D 2 ) is also 
convenient. 

This is the core of the extended saddle-point method applied to Monge-Kantorovich 
problem. To prove a practical optimality criterion one still has to translate these abstract 
properties. 

3.4. Optimal plan: preliminary results for the sufficient condition. The next 
lemmas are preliminary results for the proof of a sufficient condition for the optimality. 

Lemma 3.25. Let ip and ip be real functions on A and B. 

(1) The lower semicontinuous regularizations hip and hip of if and ip satisfy 

Is (ip®ip) = hip®hip. 

(2) If f and ip are such that ip © ip = c on some subset T of AxB and <p @ ip < c 
everywhere on AxB. Then, hip and hip still share the same properties. 

Proof. • Proof of (1). For each (a, b) G AxB, 

h(ip®ip)(a,b) = sup inf [ip(a') + ip(b')} 

V&N({a,b)) (a',b')eV 



sup( inf [ip{a')+iP{b%V A eN{a),V B eM{b)\ 

l(a',b')eV A xVB J 

sup j inf ip(a'); V A G M{a)\ + sup j inf iP(b'); V B G N(b) 



= Is cp(a) + Is ip(b) 
where A/"(x) stands for the set of all open neighbourhoods of x. 

• Proof of (2). It is a direct consequence of the lower semicontinuity of c and statement 
(1). □ 

Lemma 3.26. Let n G -P(yU, v) be such that J AxB cdn < 00. Suppose that there exists two 
real-valued functions ip G M. A and ip G M, B such that 

ip © ip < c everywhere , 
ip © ip = c ix-almost everywhere. 

(1) If ip is fi-measurable and ip is v -measurable, then ip G L\(A, fi) and ip G L\(B, v). 

(2) In any case, the real-valued functions hip and hip still satisfy l[3.27\ ) together with 
hip G Li(A,/i) and hip G Li(B, v). 

Proof. • Proof of (1). Let us fix (a D , b Q ) G AxB such that c(a ,b ) < 00 (such a point 
exists since X^^crfvr < 00 for some 7r.) We have ip(a) = c(a,b Q ) — ip{b ) > —ip(b Q ) for 
all a G A and similarly ip > —ip(a Q ). Hence, the integrals J A ipdfi G [— ip{b ), +00} and 
fgipdv G [— ip(a ), +00] are well-defined. Finally, ip G L%(A, fi) and ip G L\{B,v) since 
J A ipdfx + J B ipdu = f^cdir < 00. 
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• Proof of (2). Applying Lemma 13.251 with T a measurable set such that 7r(T) = 1 
yields two lower bounded measurable functions ls<p and Is ip which still satisfy (|3.27j) . 
One concludes as above. □ 

Let T be the a(X*, Ai)-closure of 

T = G C A x C B - V ® xp < c}. (3.28) 

Lemma 3.29. 

(a) For all (a, b) G S, A*(e a , e&) = c(a, b). 

(b) For any u G X*, we have uj G T if and only if (oj, k) < A*(k), Vk G X\. 

(c) T 2 *T C f. 

Proof. • Proof of (a). For any (a, b) G S, A*(e a ,£b) = ^{Jaxb ^ 71 ' 71 e T^ab '■ n A = 
£ a , t^b = £b} = Jj^b cde( a ,b) = c(a, b) where we used the dual equality ()2.7|) and the fact 
that Sub) is the unique plan it with marginals e a and £&. 

• Proof of (b). It is enough to check that for all <p = (ip, ip) in Ca x Cb 

0GT<^[(0,k) <A*(k),VkG X x \. (3.30) 

Young's inequality ((ft, k) < A(</>) + A*(/c), V0, k and G T <^> A(</>) = <5r(0) = give the 
direct implication. For the converse, choosing k = (e a , Eb) in the right-hand side of ()3.3()j) . 
one obtains with the previous statement (a) that ip © ip < c. 

• Proof of (c). It is clear that T*Y C T and one concludes with the a(X^, Xi)-o~(C\, CP)- 
continuity of T 2 * : X* — > £*, see Lemma Ol d. □ 

Lemma 3.31. Let tt G P{fi, v) be such that j AxB c ^' 7T < 00 an< ^ su PP ose that there exist 
two real functions ip in Li(A, ji) and ip in Li(B,v) satisfying \3.21 ). 
Then, there exists some uj in T such that 

{u,(fi, u)) = J^cdTi; 

w ( e oj e fe) — V 9 ( a ) + ^(b) for n-a.e. (a, b) and 
uj(k) < A*(\k\), Vk G X x . 

Proof. There exists a measurable subset T of S such that vr(T) = 1 and <p © if) = c every- 
where on T. Let A D be the vector subspace of X\ spanned by (fi, v) and {(e a , e^); (a, b) G 
T}. It follows from our assumptions on <p and r/> that for all positif k = (k 1; k 2 ) G A , p> 
is in Li(A, Ki) and t/> is in Li(B, k 2 )- Define the linear form uj on A 1 ,, for each k G A by 



Clearly 



Cl> (k) = / ^6^! + / if) dfv2- 
J A JB 

uj (fi,u) = / cdir (3.32) 



.4x8 



and for all positif k G E q , 



< sup < / tpdKi+ / "i>dn2\ (p&Li(A,Ki), if) G L\{B, K\), p>®ip<c 



B 



Denoting (A K ) and (A K ) the analogues of problems (A") and (A) with [k\, k 2 ) instead of 
(fi, v\ this means that 

uj (k) < sup(A K ). 
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The dual equality ()2.8|) states that sup(K K ) = A*(k). As we have already seen at Theorem 
13.21 a that sup(K K ) = swp(K K ), we obtain: sup(K K ) = A*(k). Therefore, we have proved 
that uo (k) < A*(k), for all k G E a , k > 0. As for any k G E a , uj q (k) = j^gCdp for any 
measure p with marginals pa = Ki and ps = «2> one sees that u a is positif. It follows that 

u (k) < A*(\k\), kEE 

where \k\ = (\ki\, |k 2 |) and \Ki\ is the absolute value of the measure 
Note that X\ is a Riesz space since it is the topological dual of a normed Riesz space. 
Hence, any k G X\ admits positif and negatif parts k + and k~ , and its absolute value 
is \k\ = k + + k~. This allows to consider the positively homogeneous convex function 
A*(|/c|) on the vector space E\ spanned by dom A*. By the analytic form of Hahn-Banach 
theorem, there exists an extension uo of uo to E% which satisfies u(k) < A*(\k\) for all 
k G Ei. But Ei = Xi and one completes the proof of the lemma with ()3.32|) and Lemma 

Enijb. □ 

3.5. Optimal plan: preliminary results for the necessary condition. Under the 
condition (|3.2H a). n necessarily satisfies: supp n C S. This fact will be invoked without 
warning. 

Lemma 3.33. Let n and r\ satisfy \3.21V a.b). Then, the restriction of r] to L^.n : = 
{h.7i; h G -^oo(^4 x B, 7r)} is given by 

(V,h.7i)= I hcdn, V/i G L^n) (3.34) 

J AxB 

Proof. To specify the restriction 7 of rj to L^.tc, it is enough to vary $^ in the direction 
Loo. 77 to get with ()3.22j) : 7 G d( Loc . n )*§l(7i). Taking h G L 00 (7r) such that ||/i||oo < 1, by 
monotone convergence we obtain + h.ir) = sup{J AxB (l + h)udn;u G C c ,u < c} = 
f s (l + h)cdj[. It comes out that d^L^^y^iji) = {c}, which gives (|3.34j) . □ 

We first derive the necessary condition in the special case where c is assumed to be 
finite and continuous. 

Proposition 3.35. Assume that c is finite and continuous and let tt be an optimal plan. 
Then, there exist two finitely-valued upper semicontinuous functions ip on A and ip on B 
such that 

<p © ip < c everywhere and 
<p @ip = c on supp TT. 

Proof. At the beginning of this proof, c is only assumed to be finite and lower semicontin- 
uous. By Lemma 13.191 (|3.17|) is satisfied. Let r\ and uj be as in (|3.21l -b & c). Because of 
Theorem 12. 131 -c-l & 2, there exists a generalized sequence {(a T ,/3 r )} in domAi such that 
lim r T 1 *(a T ,/3 T ) = 77 with respect to a(C' c *,C' c ). As T*domAi C U\ (see Lemma Pig). 
T*(a T , (3 T ) = a T /3 r G C c and 

(a) lim T a T © (3 T = 77 with , , 

(6) C c 3 a T © P T < c for all r [6 ' 6b) 

Defining 

7j(a,b) = ( V ,e (atb) ), {a,b)eAxB, (3.37) 
where e( Q) &) is the Dirac mass at (a, b), one immediately sees that 
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Furthermore, since rj = lim r a T © (3 T pointwise (C' c contains the Dirac masses), by (0, 
Corollary 3.5) we obtain that rj — (p © ip for some functions (p and ip on A and B. This 
gives us some hope to complete the proof, but as will be seen below, rj isn't the right 
function to be considered. 

For any (a, b) in supp7r : the support of 7r, one can find a sequence {hk} in C c such that 
limfc hk.n = e( Qi fc) in V c , see Lemma 13.601 below. As c is lower semicontinuous, with ()3.34|) 
we obtain 

liminf (h^.n, r]) = liminf / h^cdrr > c(a,b). (3.39) 

k ' k J AxB 

Unfortunately, no regularity property for rj has been established to insure that (r], €( a ,b)) > 
liminffc(?7, hk.n); this would lead to the converse of (j3.38|) : rj > c on supp7r. An alternate 
strategy is to introduce the upper semicontinuous regularization 

rj = us rj 

of rj on AxB. As rj is upper semicontinuous, for all (a, b) G supp7r, we have f](a,b) > 
limsnp k J AxB 7 f]hk dn . Now, one obtains with ()3.39|) that 

fj(a,b) > c(a,b), V(a, b) G supp7r. (3.40) 

Regularizing both sides of ()3.38|) and assuming that c is upper semicontinuous and there- 
fore continuous, we obtain that 

r/ < c. (3.41) 

It remains to check that 

r\ = ip © ijj 

for some finitely- valued upper semicontinuous functions ip and ip on A and B. With (j3.16J) 
and ()3.2H) we know that r\ = oja ®ojb for some uj G X*. It follows that rj = uja®^b where 
uja{o) = uja^o) and £5b(6) = UB^b)- With Lemma 13.251 one sees that rj = usuja © usa;^. 
This proves the desired result with p = usuja and ip — usa;^. Since rj <rj < c and both 
rj and c are finitely- valued, so are <p and ip. □ 

Remark 3.42. By means of the usual approaches 0EEI]> one can prove when c is finitely- 
valued that under the assumptions (|1.6J) or (jl.7|) . <p and ip can be required to be c-concave 
conjugates to each other. In the special case where c is assumed to be continuous, c- 
concave conjugates are upper semicontinuous. This is in accordance with Proposition 

Now, we remove the assumption that c is finite and continuous and only assume that 
it is lower semicontinuous. The main technical result for the proof of the characterization 
of the optimal plans is the following 

Lemma 3.43. Assume that c is a [1, oo]-valued lower semicontinuous function. Let n G 
V c and rj G C* be as in \3.21Y b). i.e. n G dc' c ^2{f]) an d define the function rj on S by 

rj(a,b) = (r/,e (a , 6) ), (a, b) G S. (3.44) 

Then, 

j rj < c on S 

\ rj = c on supp tt f]{c < oo} 
and rj is a finitely-valued measurable function on S. 

Remark 3.45. We assume that c > 1 without loss of generality, see Remark 13.81 to allow 
dividing by c in the definition of C c . 
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Proof. Because of Theorem 12 . 13I -C-2 . there exists a sequence {p n } in Cab such that 

Pn < c, Vn and , , 

limn-** p n = rj 

with respect to c(C'*, C' c ). Having Remark |3.9l in mind, recall that only the restriction of 
p n to S carries information as regards to the dual pairing (C r *,C' c ). Also recall that the 
k th Moreau- Yosida approximation of a function u on a space with metric d is defined for 
all x by u( k \x) = ud y {u(y) + kd(x,y)}. Defining the Moreau- Yosida approximations 

p njk = [max(p n , k)] {k) 
c k = [max(c, fc)] (fc) 



for all n, k > 1, (13.46)) implies that 

(a) p n)k , c k G C AB , Pn,k < c k < c, \/n > 1 

(b) lim n lim fc (p n , k , m) = (rj,m), Vm G M c ,m > (3.47) 

(c) < Cfc t c pointwise. 

where A1 c is the space of all measures m on S such that J s cd\m\ < oo. By Remark 13.91 
one sees that M. c C C£. 

While deriving (|3.47|) . we used the well-known results: 

• a Moreau- Yosida approximation is a continuous function and 

• the sequence of Moreau- Yosida approximations of a function tends pointwise and 
croissantly to its lower semicontinuous regularization. 

The proof of statement (|3.47l -b) relies on the monotone convergence theorem; this is the 
reason why it holds for all m in Ai c rather than in C' c . 

Let us introduce the cone Q + = {£ G C' c ; £ > and (rj, £) > 0} and Q the vector space 
spanned by Q + . We first consider the restriction 9 of rj to Q. By (13.34)1 . 7r is in Q + and 
gives us 6 G d Q *N(ir) where N(£) = sup{(u, £);ue C c , \u\ <c},£ G Q which is the 
dual norm || • ||* restricted to Q. It follows that 9 belongs to the topological dual space Q' 
of the normed space (Q, \\ ■ ||*) : 

■= v\q e Q'. 

This topological regularity of 9 will allow us a few lines below to invoke Br0nsted- 
Rockafellar's lemma. It is not clear that rj is continuous on the whole normed space 

(■:■ 

Let us denote \l/ the restriction of $^ to Q and ty* its convex conjugate with respect to the 
dual pairing (Q, Q). Since 9 G Q', 9 > and rj G T, one sees that < V*(6) < $ 2 (rj) = 0. 
As ()3.21l -b) is equivalent to Young's identity ()3.23j) . one obtains 

*(7r) + **(0) = (0,7r>= lim(a,vr) 

where 

in = Pn,k(n)\Q G Q' (3-48) 

is the restriction of p n ,k{n) ^ Cc* to Q C for some sequence {k{n)} n which converges 
fast enough to infinity to imply that lim n ^ 00 (^ n , n) = (9, n) by means of (|3.47l -b). 

Denote \l/ n the restriction to Q of the analogue of §l with Ck( n ) instead of c and ^* 
its convex conjugate with respect to (Q, Q'). By (13.471 c). we have lim^oo ^ n {ii) = ty(ir). 
By dS33-a), we also have #*(0) = **(Cn) = for all n. Therefore, 
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with lim^oo e n = 0. In other words, £ n is an e n -subgradient of ^ n at ir. Hence, by 
Br0nsted-Rockafellar lemma, there exist two sequences {vr n } in Q and {0 n } in Q' such 
that for all n, 

lk„-7r|| < v^, (3-49) 

||en-M < (3-50) 

(both norms iV(£) = sup{ (u, f);u£ C c , |u| < c} on Q and sup{(-, Q,N(£) < 1} on 

Q' are simply written || • ||) and 

6 n ed Q ,V n (n n ). (3.51) 

We define 

G = {(a,b)eS; (v,e {a , b) )>0} 

the set of all (a, b) G AxB such that e( a) &) G Q. Since c&( n ) is finite and continuous, 
proceeding as in Proposition 13.351 one shows as for ()3.4U|) that 

9 n (a,b) = Cjt( ra )(a,6), V(a, b) G cl G fl supp 7r n (3.52) 

where 

n (a, 6) = us 9 n (a, b), (a, 6) G S 
is the upper semicontinuous regularization of 

a (n h \ - S 9 n(t(a,b)) if (a, b) G G , . 
^ (G ' 6) - \ -oo otherwise ' ( a > o) G o 

and clG is the closure of G in Ax 5. Since 5 is closed, we have c\G <Z S. As 7r n may not 
be a measure, one uses Lemma 13.601 below instead of its usual analogue. 

Thanks to (|3.49|) . lim^oo 7r„ = tt strongly in Q and for all large enough n we have 
7r n G <2 + and 

SUpp 7T n = SUpp 7T. (3.53) 

With rj the upper semicontinuous regularization of rj, we have {rj > 0} = cl {?7 > 0}. It 
follows from (ET47ID that 

supp7rCclG. (3.54) 
Thanks to (jHHD and (j33UP , for all < r < oo, 

lim sup |/5 n ,fe(n)(a, 6) - w (a, b) | = 0. (3.55) 

n ^°° (a,b)€Gn{c<r} 

Let us assume for a while that 

sup c < oo. (3.56) 
s 

Under this assumption, ()3.55|) leads us to lirn„_ i . 00 sup G \p n ,k(n) ~ @n\ — on G. Upper 
regularizing, because of this uniform estimate and the continuity of p n Mn)i one obtains 

lim 7 n = (3.57) 

where 7„ = sup clG \p n ,k(n) ~ 9n\- By (I3.52J1 . f)3.53|) . (|3.54|) and p nk < p n for all n,k, we 
obtain for all large enough n : p n (a, b) > Cfc( n )(a, b) — j n , V(a, b) G supp n. Letting n tend to 
infinity, we see with ()3.46|1 . ()3.47l -c) and ()3.57j) that rj(a, b) = c(a, b), for all (a, b) G supp tc 
where rj is defined at ()3.44j) . We have just proved that under the assumption ()3.56|) . 

(a) 77(0,6) < c(a,6), V(a, b) G <S , g 5g , 

(6) r/(a,b) = c(a,6), V(a, 6) G supp 7r ^ ' ' 

where the statement (a) directly follows from (|3.46|) . 
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It remains to remove the restriction ([3.56)1 . For each k > 1, let 

<Sk = {c < k} and 

c fc = c + 

The function is lower semicontinuous on AxB and satisfies f|3.56|) : {Sk} is a croissant 
sequence of closed level sets of c with Sk C S for all k. By Proposition 13.121 -b we have 

SUpp 7T C 5. 

It is assumed that 7r 6 <9cv$ 2 (f?) which is equivalent to the Young's identity &1(ti~) + 
^2(^7) = (77, 7r) or equivalently [$i(vr) = (rj, ir) and $2(7?) = 0] which is also equivalent to 

(v^) = f AxB cdn and 

(r),£) <®l(£), WeC' c ^ Dyj 
because of Proposition 13 . 1 2i d and Lemma [3.611 below. Let us consider for each k 

( %) £) = (77, £), W G C£ such that supp £ C S k 

Note with Remark 1331 that 7r fe G C' and % G C* . Also introduce 0^ and Q* k the analogues 
of Q\ and $2 where c is replaced by c^. 

By Proposition 13.121 -d and (|3.34j) . Qki^k) = J AxB Ckd7ik = tt^}. Since for any £ E C' c 
such that supp£ C <Sfc we have Qk(£) = &l(£), one obtains with ()3.59|) that (r]k,£) < 0fc(^) 
for all £ G C' . Reasoning as for the derivation of (|3.59|) but taking the reverse way, this 
shows that 

7T fe G d C ' Ck ®* k (r)k)- 

Applying ()3.58|) yields 

rjk(a, b) < c k (a, b), V(a, 6) G S k \fk > 1 

rj k {a,b) = c k {a,b), V(a, 5) G supp 7r fc ' - 

with rjk(a, b) = {rjk, e( ,&)), (a, &) G Sk- As U&«Sfc = {c < 00}, this is equivalent to 

rj(a, b) < c(a, b), V(a, b) G {c < 00} 
?7(a, 6) = c(a, 6), V(a, 6) G supp 7r f]{c < 00} 

Finally, one sees with (|3.46|) that rj = lim^oo p n on S. This implies that rj is measurable 
on S and completes the proof of the lemma. □ 

During this proof, we have used the following elementary lemmas. 

Lemma 3.60. Let £ be a positif element of C' c . For any (a,b) G supp£, there exists a se- 
quence {hk}k>i of positif continuous bounded functions on AxB such that lim^oo hk-£ = 
£( a ,b) in with respect to a(C' c ,C c ). 

Proof. To see this, consider a decroissant sequence {Gk}k>i of neighbourhoods of (a, b) 
with linifcGfc = {(a, b)} and choose hk such that {hk > 0} C Gk and (hk,£) = 1, this is 
possible since A x B is a metric space. □ 

Lemma 3.61. For any rj G C'*, the three following statements are equivalent: 

(1) $2(77) = 0; 
(ii) rj G T; 

(Hi) (V,£) < /or a// £ G 
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Proof. The equivalence (i) 4=> (ii) is an immediate consequence of (|3.15jl . 

Let us prove: (ii) <^ (iii). Taking the closure, it is enough to check that for all u in Cab 

u e To [(u,£) <^(£), We C' c \. (3.62) 

Young's inequality (u,£) < $(w) + $*(£) and u G T <^> = for all u,£ give the 

direct implication. For the converse, choosing £ = £( a ,b) m the right-hand side of (|3.62p . 
one obtains for all (a, b) G S, u(a,b) < <&*(e( 0) &)). But, $*(£( a ,&)) = c(a, 6) by Proposition 
13.121 -d. This proves (J3.62)) and completes the proof of the lemma. □ 

3.6. Optimal plan : completing the proofs of Theorem 13.31 and 13. 5L We are now 

in position to complete the proofs of these results. 

Proof of Theorem \S.Si • Proof of (1). Sufficient condition. Let 7r G P(fi, v) be such that 
f AxB cdn < oo. Let <p and i/j satisfy (|3.27|1 . Because of Lemma 13.261 one obtains that 
Is 99 and Is ^ still satisfy ()3.27|) as well as Is y G Li(A, fi) and \sip G L\(B, v). Thanks to 
Lemma 13.311 there exists some uj G T (see ([3.28)) ) such that (a?, zv)) = Jaxb ^ 77 - But, 
with Lemma 13.291 -c: T 2 *uj G T. Therefore, one can apply Theorem 13. 241 - b which insures 
that 7r is optimal. 

Necessary condition. Let tc be an optimal plan. Because of Theorem 13.241 -b there exists 
uj G X* such that 7] := T^uj G V and ()3.2H -b) holds. With Lemma 13.43) one sees that rj 
defined by (|3.44|) satisfies rj < c on S and rj = c on supp n f]{c < oo}. By (I3.16|) . for all 
(a, b) G S we have rj(a,b) = uja(o>) + uJs(b) where uja(o>) = (^>A,^a) and uJs(b) = (u)B,€b)- 
One concludes the proof, taking p> = 1s a uja and ip — 1<s s ^b where Sa and Sb are the 
canonical projections of S on A and 5. 

• Proof of (2). It appears from Lemmas 13.311 and 13.431 that the optimal functions (<p,ip) 
and the optimal linear form uj G X* associated with 7r by the KKT condition ([3.21)1 . see 
Theorem IA.81 are related to each other by 

oo(e a , €b) = p> © 4>(a, b), for 7r-a.e. (a,b)eAxB. (3.63) 

Therefore, (|3.21|) and (|3.4|) express the same KKT condition. If ip and ip are measurable, 
then they are integrable by Lemma 13. 261 -1 and they solve (K) by Theorem 13.241 This 
proves statement (a). In the general situation (b), replacing (ip,ip) by (Is <p, Is ip) , one 
concludes similarly by means of Lemma [3.261 -2. □ 

Proof of Theorem \S.b\ By Theorem 13.31 there exist functions tpi and ipi satisfying (J3.27)) . 
By Lemma r3.26l there exist functions pi and ip2 such that P2 G Li(A, fj,) and ip2 G Li(B, v). 
Now with Lemma (3.311 one can extend (y^,?/^) i n the sense of (13.63)1 into uj G X* such 
that uj(k) < A*(\k\), Wk G X\. But, this is clearly equivalent to |o;(k)| < A*(|k|), Vk G X\. 
Applying Lemma 13.431 and taking ip = 1s a u>a and ip = 1s b ^b as in the proof of the 
necessary condition of Theorem 13.31 leads to the desired result. □ 

4. The proofs of the results of Section [2] 

The results of Section |21 are a summing up of Proposition 14. 7) Lemma ^.lll . Proposition 
14.12) Corollarv r4.171 Lemma l4~lT?l Proposition ^. 201 Proposition l4.3Ul and Proposition ^. 381 

We are going to apply the general results of the Lagrangian approach to the minimiza- 
tion problem (P) which are recalled at Appendix [X] We use the notations of Appendix 

HI 
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4.1. Preliminary technical results. Recall that = infja > 0; <&±{u/a) < 1} with 
&±( u ) — max($(ti), $(—«)). Its associated dual uniform norm is 

\£\%= sup \(u,£)\, led 

u,\u\<§<1 

on L\. The topological dual space of (£%, \ ■ |J) is denoted by £[. It is the topological 
bidual space of (Ui, \ ■ |$). 

Similarly, recall that \y\ A = mi{a > 0;A±(y/a) < 1} with A±(y) = max(A(j/), A(— y)). 
Its associated dual uniform norm is 

\x\* A — sup \(y,x)\, x G X\ 
s/ 5 |j/|a<i 

on X\. The topological dual space of (Xi, \ ■ | A ) is denoted by A"/. It is the topological 
bidual space of (3^i, | • |a). 

The adjoint operator T-f which appears at Lemma 14.11 -f below is defined as follows. For 

all u G X[ and all £ G d, \(T}u,£) ChCl \ = \(u,T£) xhXl \ 

Lemma 4.1. Let us assume (H$) and {Ht). 

(a) dom$* C L\ and domA* C X 1 

(b) T(dom$*) C domA* and TC X C X x 

(c) T is a (C,U) -a (X ,y)- continuous 

(d) T 2 * : X* — > C\ is o (Xf, X x ) -o (C[, Ci)- continuous 

(e) T\ : L\ —>■ X\ is \ ■ ■ \* A - continuous 

(f) TlX(c£[ 

(g) T*yi C IA\ and T x * : — > U\ is cr(^i, Xi)-a(Ui, Ci) - continuous 

(h) Ti : Ci —* Xi is a (Ci,Ui) -a (Xx,yi)- continuous 

Proof. • Proof of (a). For all £ G £ and a > 0, Young's inequality yields: (u,£) = 
a{£,u/a) < [<&(u/a) + $*(£)]a, for all «6W. Hence, for any a > < [l + $*(£)]<x 

It follows that (w, < [1 + $*(£)] |w|<j>. Considering — u instead of u, one gets 

\(u,£)\ < [l + $*(£)]\u\*,VueU,£e C. (4.2) 

It follows that dom$* C C\. One proves domA* C X\ similarly. 

• Proof of (b). Let us consider | • |$* and | • | A * the gauge functional of the level sets 
{<3>± < 1} and {A^ < 1}. It is easy to show that 

A* ± (x) < for all£ G £ and x G X such that T£ = x (4.3) 

Therefore, T(dom$! b ) C domA^. On the other hand, by Proposition IB. II fsee the Ap- 
pendix), the linear space spanned by dom*^^ is dom | • |$« and the linear space spanned 
by dom A^ is dom | • | A * . But, dom | • |$* = dom | • | J = L\ and dom | • | A *_ = dom | • ] A = X\ 
by Proposition IB. II again. Hence, TC\ C X\. 

• Proof of (c). To prove that T is continuous, one has to show that for any y G y, 
t G C i— > (y, T£) G 1 is continuous. We get £ i— ► (y, T£) = (T*y, £) which is continuous 
since {H T1 ) is T*y G U. 

• Proof of (d). It is a direct consequence of T L\ C X\. See the proof of (c). 



• Proof of (e). We know by Proposition IB.ll that | • |$* ~ | • || and | • | A * ~ | • | A 
are equivalent norms on C\ and X\ respectively. For all £ G C\, \T£\* A < 2\T£\ A * ± = 
2inf{a > 0;A* ± (T£/a) < 1} < 2 inf {a > 0;®* ± (£/a) < 1}. This last inequality follows 
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from ([4.3)1 . Going on, we get \T£\* A < 2\£\&* < 4:\£\%, which proves that T\ shares the 
desired continuity property with ||Ti|| < 4. 

• Proof of (f). Let us take u E X[. For all £ E d, \ (T}u, l) c ^ Cl \ = \{u,T£) x{>Xl \ 
< \\u>\\ X '\T£\ A < ||c<j||^/||Ti|| \£\$ where ||Ti|| < oo, thanks to (e). Hence, T-fu; stands in £[. 

• Proof of (g). Let us take y E y%. We've just seen that Tfy stands in C[. Let us show that 
in addition, it is the strong limit of a sequence in IA. Indeed, there exists a sequence (y n ) 
in y such that ]im Tlr ^ 0O y n = y in (y 1 , \ ■ | A ). Hence, for all £ E £ u \(T*y n -T*y,£)c*,c x \ = 
\(y n -y,T£)y ltXl \ < \\Ti\\\y n -y\A\£\% and sup^^*^ \{T£y n - T?y,£)\ < \\T l \\\y n -y\ A 
tends to as n tends to infinity, where T?y n = T*y n belongs to U for all n > 1 by (Hti)- 
Consequently, T*y E U\. 

The continuity statement now follows from (d). 

• Proof of (h). 

By (b), Ti maps L\ into X\ and because of (g): T*3^i C U\. Hence, for all y E y±, 
£ I— > (Ti£,y)x u y 1 = (£,T*y)c u U! is cr(£i, Wi)-continuous. This completes the proof of 
Lemma 14.11 □ 

Let $q, Aq and be the convex conjugates of $ , A and Ai for the dual pairings 

(u,c),(y,x)asid(y 1 ,x 1 ). 

Lemma 4.4. Under the hypotheses (H$) and (Ht), we have 

(a) $ = $! < $ on U (a 7 ) A = A 1 < A on y 

(b) $* = % on C (b ') A* < Aq on X 

(c) $* = $* = $* on & (c') A* < A* < A* on X x 

Proof, (a) follows directly from Lemma [4.11 -a. (a') from (a) and (b') from (a'). 

(b) follows from the general fact that the convex conjugates of a function and its convex 

lower semicontinuous regularization match. 

Let us show (c). As U is a dense subspace of U\, we obtain that the restriction of $* to 
C\ is also the convex conjugate of $ (restricted to C\) for the dual pairing (Ui, Ci). Now, 
with the same argument as in (b), this implies that $* = $J on C\. 
(c') follows from (a'), the fact that y is a dense subset of 3^i, the weak continuity of T* 
which is proved at Lemma [4.11 -g and the lower semicontinuity of $i. □ 

Lemma 4.5. Under the hypothesis (H$), 

(a) $* = $q is a(C,U)-inf-corapact and 

(b) $* is a (Ci,Ui)-inf- compact. 

Proof. • Proof of (b). We first prove that $J is a (£i, Wi)-inf-compact. Recall that we 
already obtained at (|OD that \(u,£)\ < [1 + $*(£)] \u\$, for all u E U and £ E C. By 
completion, one deduces that for all £ E L\ and u E U%, \(u,£)\ < [1 + (recall 
that $* = $* on Ci, LemmalOl-c.) Hence, $*(£) < A implies that \£\% < A+l. Therefore, 
the level set {$* < A} is relatively a(£\, Wi)-compact. 

By construction, $* is <t{C\, Wi)-lower semicontinuous. Hence, {$* < A} is cr(£i, U\)- 
closed and o~(£i, Wi)-compact. 

• Proof of (a). As $* = $q = $* on C\ fLemmaOc), dom$* C L\ (Lemma OJ-a) and 
U C Ui, it follows from the cr(£i, Wi)-inf-compactness of $^ that $* = $g is cr(£,W)-inf- 
compact. □ 
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4.2. A first dual equality. In this section we only consider the basic spaces L(, £, y and 
X. Let us begin applying Appendix 1X1 with (P, A) = (U, £) and (B, Q) = (y, X) and the 
topologies are the weak topologies a(£,U), a(U,£), a(X,y) and o~(y, X). The function 

f if x G C 

to be minimized is f(£) = &*(£) + 5c(T£), i G £ where Sc(x) = < . f , n denotes 

I ~rOO it X G O 

the convex indicator of C. The perturbation F of / is Fenchel's one: 

F {1, x) = $*(£) + 5 C (T£ + x),£e£,xeX. 
We assume (Hti): T*y C W, so that the duality diagram is 

u , c 



T (Diagram 0) 



y , x 

The analogue of F for the dual problem is 

G Q {y, u) ± inf {{y, x) - (u, £) + F (£, x)} = mf{y, x) - $ (T*y + u). 

£,x xGC 

The corresponding value functions are 

(p (x) = inf{$*(£);£G £:T£eC -x}, xeX 

7o(u) = sup{inf (y,ar) - <£> (T*y + u)}, uEli. 
yey x ^ c 

The primal and dual problems are (P) and (Do). 

Lemma 4.6. Assuming (H$) and (Hti), if C is a cr(X, y)-closed convex set, F is jointly 
closed convex on £ x X. 

Proof. As T is linear continuous (Lemma l4.1l -c) and C is closed convex, {(£, x); T£+x G C} 
is closed convex in £ x X. As $* is closed convex on £, its epigraph is closed convex in 
£ x R. It follows that epi F = (X x epi <£>*) fl {(£, x); T£ + x G C} is closed convex, which 
implies that F is convex and lower semicontinuous. As it is nowhere equal to — oo (since 
inf Fq > inf $* > — oo, Fq is also a closed convex function. □ 

Therefore, assuming that C is a a(X ,y)-closed convex set, one can apply the general 
theory of Appendix El since the perturbation function Fq satisfies the assumptions (jA.l|) 
and (TOl). 



Proposition 4.7. Let us assume that (H$) and (Ht) hold. If C is convex and o~(X,y)- 
closed, we have the dual equality 

inf(P) = sup(£> ) e [0, oo]. (4.8) 

In particular, for all x in X , we have the little dual equality 

inf{$*(£); £ G C, T£ = x} = A* (x) G [0, oo]. (4.9) 

Proof. The identity (|4.9j) is a special case of (|4.8|) with C = {x}. 

To prove (|4.8|) . we consider separately the cases where inf(P) < +oo and inf(P) = +oo. 

Case where inf(P) < +oo. Thanks to Theorem IA.6l -b'. it is enough to prove that 70 is 
upper semicontinuous at u — 0. We are going to prove that 70 is continuous at u — 0. 
Indeed, for all u EU, 

- 7o (m) = mf{$o(T*y + u)- mUy,x)} < %(u) < $(u) 

y xeC 
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where the first inequality is obtained taking y = 0. The norm | • |$ is designed so that 
$o is bounded above on a | • ^-neighbourhood of zero. By the previous inequality, so 
is the convex function —70. Therefore, —70 is | • (^-continuous on icordom (— 70) 3 0. As 
it is convex and L\ = (U, \ ■ |$)', it is also u(U, £i)-lower semicontinuous and a fortiori 
cr(V(, £)-lower semicontinuous , since C\ C C 

Case where inf(P) = +00. Note that sup(Po) > —$0(0) = > —00, so that we can apply 
Theorem IA.6l b. It is enough to prove that 

ls<^ (0) = +00 

in the situation where y2 (0) = inf (P) = +00. We have lsy9 (0) = supy eA ^ ) inf{$Q(£); t : 
T£ G C + V} where jV(0) is the set of all the a(X, 3^-open neighbourhoods of G X. It 
follows that for all V G W(0), there exists i G C such that Tt G C+V and %{£) < \s<p {0). 
This implies that 

T({$* < Is v5 (0)}) n (C + V) ± 0, VV G Af(0). (4.10) 

On the other hand, inf(P) = +00 is equivalent to: T(dom$Q) H C = 0. 

Now, we prove ad absurdum that ls<^o(0) = +00. Suppose that ls^o(0) < +00. Because 

of T(dom$o) H C = 0, we have a fortiori 

T({$S<ls^ o (O)})nC7 = 0. 

As <£>q is inf-compact fLemma 14.51 -a) and T is weakly continuous ( Lemma 14. ll -c). T({&q < 
ls<^o(0)}) is a a(X, y)-compact subset of X. Clearly, it is also convex. But C is assumed 
to be closed and convex, so that by Hahn-Banach theorem, C and T({§q < ls<^ (0)}) 
are strictly separated. This contradicts ()4.1U|) . considering open neighbourhoods V of the 
origin in (j4.10J) which are open half-spaces. Consequently, h(p (0) = +00. This completes 
the proof of the proposition. □ 

4.3. Primal attainment and dual equality. We are going to consider the following 
duality diagram, see Section 12731 



T{\ T- L (Diagram 1) 



Note that the inclusions T\C\ C X\ and T*y% C U\ which are stated in Lemma 14.11 are 
necessary to validate this diagram. 

Let F\,G\ and 71 be the analogous functions to F Q , Go and 70. Denoting <pi the primal 
value function, we obtain 

Ft(£,x) = m£)+5 Cl (T 1 £ + x),£eC 1 , x G X x 
Gi(y,u) = inf (y,x) - $i(T*y + u), yey^ueUx and 

iptix) = inf{$*(£); £eC 1 :T 1 £eC 1 - x}, x G X t , 
7i(m) = sup{inf (y,x) -$i(T*y + u)}, u G U\. 

It appears that the primal and dual problems are (Pi) and (Pi). 

Lemma 4.11. Assuming (H$) and {Ht), the problems (P) and (Pi) are equivalent: they 
have the same solutions and inf (P) = inf (Pi) G [0, 00]. 
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Proof. It is a direct consequence of dom$* C £1, TC\ C X x and $* = $* on C\, see 
Lemma f4.H -a.b and Lemma [4.41 -c. □ 

Proposition 4.12 (Primal attainment and dual equality). Assume that (H®) and (Pt) 
hold. 

(a) For all x in X\, we have the little dual equality 

inf {$*(£); t G C,T£ = x} = AJ(ar) G [0,oo]. (4.13) 
Assume that in addition (H c ) holds. 

(b) We have the dual equalities 

inf(P) = sup(P>i) G [0, oo] (4.14) 
inf(P) = inf(Pi) = inf A*(rr) G [0, oo] (4.15) 

(cj //in addition inf(P) < oo, then (P) is attained in C\. 

(d) Let I G C\ be a solution to (P), then x — T£ is a solution to (Pi, a 1 ) and inf(P) = 
$*(£) = A*(x). 

Proof. • We begin with the proof of ()4.14|) . As, inf (P) = inf (Pi) by Lemma ^.lll we have 
to show that inf(Pi) = sup(Pi). We consider separately the cases where inf(Pi) < +oo 
and inf (Pi) = +oo. 

Case where inf (Pi) < +oo. Because of (He), Pi is jointly convex and F\(£, •) is o~(Xi, 3^i)- 
closed convex for all I G L\. As T*y^ C Wl fLemma 14. 1|) . one can apply the approach of 
Appendix El to the duality Diagram 1. Therefore, by Theorem IA.6l b'. the dual equality 
holds if 71 is a(Ui, £i)-upper semicontinuous at 0. As in the proof of Proposition 14.71 
we have —71(11) < $i(u), for all u G U\. But $1 is the a(Ui, £i)-lower semicontinuous 

regularization of $(tt) = < ^ u ^ ^ , u E IA\ and $ is bounded above by 1 on 

v ' I +00 otherwise 

the ball {u G < 1}. As £1 = (ZYi, | • |$)', $1 is also the | • |$-regularization of $. 

Therefore, $1 is bounded above by 1 on {u G ZYi; < 1}, since {w G < 1} is 

I • |$-dense in {u G ZYi; |tt|$ < 1}. As — 7i(< $1) is convex and bounded above on a | • |$- 
neighbourhood of 0, it is | • (^.-continuous on icordom (—71) 3 0. Hence, it is o~(Ui, £i)-lower 
semicontinuous at 0. 

Case where inf(P x ) = +00. This proof is a transcription of the second part of the proof 
of Proposition 14.71 replacing T by Ti, C by Ci, all the subscripts by 1 and using the 
preliminary results: is inf-compact ( Lemma I4.5JI and Ti is weakly continuous (Lemma 
!4.H -h). This completes the proof of (J4.14j) . 

• The identity (ETTHj) is simply (ETTl) with C x = {x}. 

• Let us prove (c). By Lemma l4~TT -h. 7\ is a(C\, U\)-<j(X\, 3^i)-continuous. Since C\ is 
a(Xi, 3^i)-closed, {£ G L\\T£ G Ci} is er(£i, Wi)-closed. As $* is a(C\, Wi)-inf-compact 
( Lemma 14. 5|) . it achieves its infimum on the closed set {£ G C%;T£ G Ci} if inf (Pi) = 
inf(P) < 00. 

• Let us prove (|4.15j) . The dual equality (|4.14|) gives us, for all x G Ci, inf (Pi) = 
su Pj/eyi{ inf ^GCi(y,a;) - Ai(y)} < sup yeyi {(x ,y) - Ai(y)} = A*(x ). Therefore 

inf(Pi) < inf A*(x). (4.16) 

In particular, equality holds instead of inequality if inf (Pi) = +00. Suppose now that 
inf (Pi) < 00. From statement (c), we already know that there exists £ G £1 such that 
x = T£eC 1 and inf(Pi) = $*(!). Clearly inf(Pi) < inf {§*(£) ;T£ = x,£ G £1} < $*(!). 
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Hence, inf(Pi) = inf{$J(f);T£ = x,£ E d}. By the little dual equality (PLT3J) we have 
inf{$*(£);T£ = x,£ G C\] = A*(x). Finally, we have obtained inf (Pi) = A*(x) with 
x G C\. Together with (j4.16|) . this leads us to the desired identity: inf (Pi) = inf^d A\(x). 
• Finally, (d) is a by-product of the proof of (|4.15j) . □ 

The following result is an improvement of Lemma 14.41 -c'. 

Corollary 4.17. We have domA^ C domAg, domA^ C X\ and in restriction to X\, 
A* — A* 

Proof. The first part is already proved at Lemma 14.11 -a. The matching Aq = A^ follows 
from flOJ) and (PLT31) . □ 

4.4. Dual attainment. We now consider the following duality diagram 



X\ , X{ 



T 2 * (Diagram 2) 



where the topologies are the respective weak topologies. The associated perturbation 
functions are 

F 2 (£,x) = $l(e) + 6 Cl (T£ + x), £e£i,xeX 1 
G 2 ((,u) = inf (x,u)-$ 2 (t;u + 0, (e£l,LueX* 

As F 2 = Fi, the primal problem is (Pi) and its value function is (pi : 

iptix) = inf A*(x'), x G X x (4.18) 

where we used ()4.13|) . The dual problem is (D 2 ). 

Assume that inf(P) < 00. We know by Proposition 14.121 -d that (Pi,*) admits at least 
a solution x = Tl where £ is a solution to (Pi). Let us consider the following new 
minimization problem 

minimize $*(£) subject to T£ = x, £ G L\ (Pf) 

Of course £ is a solution to (Pi) if and only if it is a solution to (Pf) where x = T£. 
Since our aim is to derive a representation formula for £, it is enough to build our duality 
schema upon (Pf) rather than upon (Pi). The associated perturbation functions are 

Fi(£,x) = $t(£) +5 {s} (T£ + x), £e£ 1 ,xeX 1 

G s 2 (C,u) = (x,u)-$ 2 (t;u + 0, (eci,uex* 

As F£ is Pi with C\ = {x}, the primal problem is (Pf ) and its value function is 

ipf(x) = A*(x - x), x G X x . 

The dual problem is 

maximize (x,u) — A 2 (u), uo G X* {D 2 ) 

Lemma 4.19. Under the hypotheses (H$) and (Ht), A\ is o~(Xi,yi) -inf- compact. 

Proof. By (JUS): inf {<!>*(£); £ G L X ,T X £ = x} = A&x) for all x G X x (note that $* = §\ 
on L\ by Lemma [Ol d As T\ is continuous (Lemma l4.H -h) and $* is inf-compact (Lemma 
14.5)) . it follows that A^ is also inf-compact. □ 

Proposition 4.20 (Dual attainment). Assume that (H$), (H T ) and (He) hold. 
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(a) Suppose that 

CnicordomA* ^ 0. (4.21) 
Then the dual problem (D 2 ) is attained in Xf. 

(b) Suppose that C R dom ^ 0. Then, inf(P) < oo and we know (see Proposition 1^.12 - 
d) that (Pi,x) admits at least a solution. If in addition, there exists a solution x to 
(Pi,x) suc h that 

x G diffdomA*, (4.22) 
then the dual problem (D%) is attained in X{. 

Proof. • Proof of (a). As F 2 = one can apply the approach of Appendix El to the 
duality Diagram 2. Let us denote tp\* the cr(Xi, J\)-lower semicontinuous regularization 
of <fi and <p 2 * its <j(X\, X*)-lower semicontinuous regularization. Since X\ separates 3^i, 
the inclusion y x C X* holds. It follows that (p?(0) < y? 2 *(0) < ¥>i(0). But we nave dHH 
which is <pl*(0) = <£i(0). Therefore, one also obtains (f 2 *(0) = <^i(0) which is the dual 
equality 

inf(P!) = sup(D 2 ) (4.23) 
and one can apply Theorem IA.6I -C which gives 

argmax(£> 2 ) = -fy>i(0). (4.24) 

It remains to show that the value function <pi given at (j4.18|) is such that 

3^(0)^0. (4.25) 

As the considered dual pairing (Xi,X*) is the saturated algebraic pairing, for ()4.25|) 
to be satisfied, by the geometric version of Hahn-Banach theorem, it is enough that 
G icordom^i. But this holds provided that the constraint qualification (J4.21)) is satisfied. 

• Proof of (b). Let us specialize to the special case where C\ = {x}. The dual equality 
()4.23j) becomes 

inf(Pf) = sup(Df) (4.26) 
and ()4.25|) becomes d(p*(0) ^ which is directly implied by ()4.22|) . □ 

Remark 4.27. The dual equality (|4.2(jl) is 

A* = A; (4.28) 

where these convex conjugates are to be taken respectively with respect to (Xi,yi) and 
(Xi, X*}. Denoting Ai and A 2 the convex cr(X*, Ai)-lower semicontinuous regularizations 
of Ai and A 2 , (|4.28|) implies the identity 

Ai = A 2 . (4.29) 

Usual results about convex conjugation tell us that A*(x) = sup^g^. {(x, u) — Ai(cj)} = 
sup(-D 2 ) and the above supremum is attained at uj if and only if uj G dx*h\{x). This is 
the attainment statement in Proposition !4.2Ul -b. 

4.5. Dual representation of the minimizers. We keep the framework of Diagram 2 
and derive the KKT relations in this situation. The Lagrangian associated with F 2 = Fi 
and Diagram 2 is for any £ G C%, uj G Xf, 

K 2 (£,uj) = inf {(x,u) + $* 1 (£) + 5c 1 (T£ + x)}, 

= <5>l(£)-(T£,u)+ mf(x,u). 

Proposition 4.30. Assume that (H§), (H T ) and (He) hold. 



34 



CHRISTIAN LEONARD 



(a) Any £ G C\ is a solution to (Pi) if and only if there exist some to G X* such that the 
following three statements hold 

(1) Tie C 

(2) (T£, Q) < (x, Q) for all x G d 

(3) and the following representation formula holds 

I e d Cl ®2(T*u) (4.31) 

More, these three statements hold if and only if: £ is solution to {Pi), Co is a solution 
to {D 2 ) and inf(Pi) = sup(P2)- 

Statement ^4-31\ ) is equivalent to the Young's identity 

+ $ 2 (T 2 *cj) = (Tl, Q). (4.32) 

(b) (Assumption {He) is useless here). Any £ G L\ is a solution to {P x ) if and only if 
Tl = x and there exists some Co G X* such that \4-31\l or equivalently \4.32\ ) holds. 
More, this occurs if and only if: £ is a solution to (P), Co is a solution to (Pf) with 
x := Tl and inf(P 5 ) = sup(Pf). 

Proof. This proof is an application of Theorem IA.8I Under the general assumptions (P$), 
{H T ) and {H c ), we have seen at Proposition [OH that the dual equalities (|4.23|) and (|4.26|) 
hold true. In both situations (a) and (b), {£,Co) is a saddle-point; all we have to do is to 
translate the KKT relations (TO0|) and (IXTTJ) . 

• Proof of (a). With K 2 as above, (TOO]) and (TOTl) are d e K 2 {£, Co) 3 and d u (-K 2 ) {£, Co) 3 
0. Since —(T£,u) is locally weakly upper bounded as a function of uo around Co and 
as a function of £ around £, one can apply (Rockafellar, [7j, Theorem 20) to derive 
d t K 2 {£,Co) = d<&*{£) -T*lo and d u (-K 2 )(I,u>) = d{-inf x€Cl (x,-)) +T£. Therefore the 
KKT relations are 

T^Co G d^*{£) (4.33) 
-Tl G d(6*_ Cl )(Q) (4.34) 

where 5*_ Ci is the convex conjugate of the convex indicator of —C\. 

As a convex conjugate, $* is a closed convex functions. Its convex conjugate is $2- 

Therefore (J4.33)) is equivalent to the following equivalent statements 

£ G o^ 2 {t;cj) 

$*(£) + $ 2 {T;uj) = (£,T*cu) 

Similarly, as a convex conjugate 5* c is a closed convex functions. Its convex conjugate is 
5_Ci where C\ stands for the a(Xi, ^*)-closure of C\. Of course, as C\ is o{X\, 3^i)-closed 
by hypothesis {He), it is a fortiori <j{X\, Af*)-closed, so that C\ = C\. Therefore (|4.34|) is 
equivalent to 

5 Cl {T£) + Sl Cl {u) = (-T£,u). (4.35) 
It follows from (|4.35j) that d^iTi) < 00 which is equivalent to T£ G C\. 
Now (|4.35|) is —(T£,Co) = 5*L Ci {Co) = — inf xeCl ( x , Co) which is (T£,Co) = xnf xe c 1 (x,u)). This 
completes the proof of (a). 

• Proof of (b). This follows directly from (a) with x = T£ and C\ = {x}. □ 
Remark 4.36. Thanks to Proposition I4.12k l. ()4.32|) leads us to 

A{{x) + A 2 {oo) = (x,Q) (4.37) 

for all x G domAJ and all Co G X* solution to (Pf). By Young's inequality: A 2 {x) + 
A 2 {co) > (x,u>) and the identities (PHI) . (Q7|) . we see that A 2 {to) > A 2 {to). But, the 
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reversed inequality always holds true. Therefore, we have A 2 (u;) = A 2 (uj). This proves 
that A 2 = A 2 on domA2. 

Proposition 4.38. Assume that (H$), (H T ) and (He) hold. Any solution uj of (D 2 ) or 
(D2) shares the following properties 

(a) uj stands in the o~(X( , X\)-closure o/domAi. 

(b) T 2 *ui stands in the o '(£*, C\)- closures o/Tj* (domAi) and dom$. 

(c) For any x Q G Xi, let us denote jo Xo and J-d Xo the gauge functionals on X\ of the 
convex sets D Xo and —D Xo where D Xo — {x G X\\ A\{x + x) < A^(x Q ) + 1}. 

- Let uj be any solution of (D 2 ). Then, for any x Q in C H icordomA^, Uj is 
jD Xo -upP er semicontinuous and j-o Xo -lower semicontinuous at 0. 

- Let uj be any solution of (-D 2 ) with x G icordomAp Then, uj is jD x -upP er 
semicontinuous and j^n^-lower semicontinuous at 0. 

Proof. • Proof of (a). Because of (|4.37|) . we have uj G domA 2 . As A 2 < A 2 and Ai = A 2 
(see f!4.29|) ). we obtain uj G domAi which implies (a). 

• Proof of (b). It follows from (a) and the continuity of T 2 *, see Lemma l4.11 d that T 2 *a) 
is in the o~(£l, £i)-closure of T^domAi). On the hand, T^uj G dom$ 2 and $ 2 is the 
c(C\, £i)-closed convex closure of $. It follows that T 2 *o> is in the cr(Cl, £i)-closure of 
dom $. 

• Proof of (c). Let uj G argmax(D 2 ). By f!4.18|) and ()4.24|) . for all x G X t and any x a G C\, 

(-uo,x) < </?i(x)-<^i(0) < Al(x o -x)-ip(0) < Aftx -x). It follows that (uu,x) < a{(x ) + 1 
for all x G D Xo . This implies that for all x G Xi, (uj,x) < [1 + A\(x )]iD Xo (x). Since 
]d{— — j-D(%), we finally obtain 

-[l + Al(x )]j. Dxo (x) < (Q,x) < [l + A* 1 (x )}j Dxo (x),\/xeX l 

for any i„ 6 Ci, which is the desired result. Choosing x Q in C\ fl icordom A* implies that 
jo Xo is a nondegerate homogeneous functional. 

The second case where uj G argmax(Df ) is a specialization of the previous one. □ 

Appendix A. A short reminder about convex minimization 

To quote easily and precisely some well-known results of convex minimization while 
proving our abstract results at Section we give a short overview of the approach to 
convex minimization problems by means of conjugate duality as developed in Rockafellar's 
monograph jjj. For complete proofs of these results, one can also have a look at the 
author's lecture notes 0. 

Let A be a vector space and / : A — > [—00, +00] an extended real convex function. We 
consider the following convex minimization problem 

minimize f(a),a G A (V) 

Let Q be another vector space. The perturbation of the objective function / is a function 
F : A x Q ->• [-00, +00] such that for q = G Q, F(-,0) = /(•)■ The problem (V) is 
imbedded in a parametrized family of minimization problems 

minimize F(a, q),a G A (V q ) 

The value function of (V q ) q€ Q is 

Lp(q) — inf(V q ) = inf F(a,q) G [—00, +00], q G Q. 
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Let us assume that the perturbation is chosen such that 

F is jointly convex on A x Q. (A.l) 

Then, (V q ) q( zQ is a family of convex minimization problems and the value function (p is 
convex. 

Let B be a vector space in dual pairing with Q. This means that B and Q are locally 
convex topological vector spaces in separating duality such that their topological dual 
spaces B' and Q' satisfy B' = Q and Q' = B up to some isomorphisms. The Lagrangian 
associated with the perturbation F and the duality (B, Q) is 

K(a,b) = inf {(b, q) + F(a, q)}, a G A,beB. (A.2) 

Under (jA.l|) . K is a convex-concave function. Assuming in addition that F is chosen such 
that 

q i— > F(a, q) is a closed convex function for any a G A, (A. 3) 

one can reverse the conjugate duality relation (|A.2|) to obtain 

F(a,q) = sup{K(a,b) - (b,q)},Va e A,qeQ (A.4) 
fees 

Introducing another vector space P in separating duality with A we define the function 

G(b,p) = inf {if (a, 6) - (a,p)},b G B lP eP. (A.5) 

This formula is analogous to (jA.4|) . Going on symmetrically, one interprets G as the 
concave perturbation of the objective concave function 

g(b)±G(b,0),beB 

associated with the concave maximization problem 

maximize g{b),b G B {T>) 

which is the dual problem of (V). It is imbedded in the family of concave maximization 
problems (V p ) p( zp 

maximize G(b,p), b G B (V p ) 

whose value function is 

7(p) = sup G(b,p),p G P. 
Since G is jointly concave, 7 is also concave. We have the following diagram 

t(p) f( a ) 

( P , A ) 

G(b,p) K(a,b) F(a,q) 

( B , Q } 

The concave conjugate of the function / with respect to the dual pairing (Y, X) is 
f*{y) = ^x{(y,x) — f{x)} and its superdifferential at x is df(x) = {y G Y;f(x') < 
f(x) + (y,x' - x)}. 

Theorem A. 6. We assume that (P, A) and {B, Q) are topological dual pairings. 

(a) We have sup(X>) = ip**(0). Hence, the dual equality inf('P) = sup(P) holds if and 
onlyif<p(0) = <p**(0). 
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(b) In particular, 

• F is jointly convex ) 

• (f is lower semicontinuous at > =>> inf('P) = sup(D) 

• sup(£>) > — oo J 

(c) If the dual equality holds, then 

argmax g = — <9(/?(0). 
Let us assume in addition that HA . 1)) and \A.3\) are satisfied. 

(a') We have inf(P) = 7**(0). Hence, the dual equality inf('P) = sup(X>) holds if and 

onlyif 1 (0) = 1 ii (0). 
(b') In particular, 

• 7 is upper semicontinuous at 

• inf(P) < +oo 

(c') If the dual equality holds, then 

argmin / = —8^(0). 

Definition A. 7 (Saddle-point). One says that (a,b) e A x B is a saddle-point of the 

function K if 

K(a,b) < K(a,b) < K(a,b), \/aeA,beB. 

Theorem A. 8 (Saddle-point theorem and KKT relations). The following statements are 
equivalent. 

(1) The point (a, b) is a saddle-point of the Lagrangian K 

(2) f{a)<g{b) 

(3) The following three statements hold 

(a) we have the dual equality: sup(X>) = inf^T 7 ), 

(b) a is a solution to the primal problem (V) and 

(c) b is a solution to the dual problem (V). 

In this situation, one also gets 

sup(P) = mi(V) = K(a,b) = f(a) = g(b). (A.9) 

Moreover, (a, b) is a saddle-point of K if and only if it satisfies 

d a K(a,b) 3 (A. 10) 

d b K(a,b) 3 (A.ll) 

where the subscript a or b indicates the unfixed variable. 

Appendix B. Gauge functionals associated with a convex function 

The following result is well-known, but since I didn't find a reference for it, I give its 
short proof. 

Let 6 : S — » [0, oo] be an extended positif convex function on a vector space S, such 
that 0(0) = 0. Let S* be the algebraic dual space of S and 9* the convex conjugate of 6 : 

9*{r) = sup{(r,s) -9(s)},r G S*. 

ses 



I inf(P) = sup(P) 
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It is easy to show that 9* : S* -> [0, oo] and 0*(O) = 0. We denote Cg = {9 < 1} and 
Ce* = {$* < 1} the unit level sets of 9 and 9*. The gauge functionals to be considered are 

jg(s) = inf{a > 0; s G aCg} = inf{a > 0; 9(s/a) < 1} G [0, oo], s G S 1 . 

>(r) = infja > 0; r G aC^*} = inf{a > 0; 9*(r / a) < 1} G [0, oo], r G S 1 *. 

As belongs to Cg and Cg* , one easily proves that jg and jg* are positively homogeneous. 
Similarly, as Cg and Cg* are convex sets, jg and jg* are convex functions. 

Proposition B.l. Let 9 : S — > [0, oo] fre an extended positif convex function on a vector 
space S, such that 9(0) = as above. Then for all r G S*, we have 

\je*{r) < 8* c (r) = sup(r,s) < 2j *(r). 
2 sec e 

We also have 

cone dom^* = dom jg* = dom<5^ fl 
where cone dom (9* is the convex cone (with vertex 0) generated by dom#*. 

Proof. • Let us first show that Sq (r) < 2jg*(r) for all r G S*. If jg*(r) > 0, then for all 
s G Cg, (r,s) = (r/jg*(r),s) Je *(r) < [9(s) + 9*(r/jg*(r))] Je *(r) < (1 + l) Je *(r). 
If jg*(r) = 0, then 9*(tr) < 1 for all t > 0. For any s G Cg, we get (r,s) = j(tr,s) < 
\[9(s) + 9*(tr)} <2/t. Letting t tend to infinity, one obtains that (r, s) < 0. 

• Let us show that jg*{r) < 2<5£ e (r). If S^ e (r) = oo, there is nothing to prove. So, let us 
suppose that b~c e (r) < oo. As G Cg, we have 5q • (r) > 0. 

First case: ^c , e ( r ) > 0- ^or all s G 5 and e > 0, we have s/[jg(s) + e] G Cg. It 
follows that (r/6* Cg {r), s) = (r, s/[j e (s) + e]}^ < 5* Ce {r)^ = 3e (s) + e. Therefore, 
(r/5* Cg {r),s) <jg(s), for all s G 5. 

If s doesn't belong to Cg, then jg(s) < 9(s). This follows from the the assumptions on 9 : 
convex function such that 9(0) = = min 9 and the positive homogeneity of jg. Otherwise, 
if s belongs to Cg, we have jg(s) < 1. Hence, (r/S^ (r), s) < max(l, 9(s)),Ws G S. On the 
other hand, there exists s Q G S such that 9*(r/[25* Ce (r)}) < (r /[25* Cg (r)], s Q ) - 9(s Q ) + 1/2. 
The last two inequalities provide us with 9*(r /[25* Cg (r)}) < \ max(l, 9(s Q )) — 9(s Q ) + \ < 1 
since 9(s Q ) > 0. We have proved that jg*(r) < 25^< g (r). 

Second case: ^c 9 ( r ) = 0- ^ e have (r,s) < for all s G Cg. As dom 9 is a subset of 
the cone generated by Cg, we also have for all t > and s G dom#, (tr, s) < 0. Hence 
(tr, s) - 9(s) < for all s G S and 9*(tr) < 0, for all t > 0. As 9* > 0, we have 9*(tr) = 0, 
for all t > 0. It follows that jg*(r) = 0. This completes the proof of the equivalence of jg* 
and 5* r . 

• Finally, this equivalence implies that dom jg* = dom^ and as #*(0) = we have 
G dom#* which implies that cone dom#* = dom jg*. □ 
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